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Abstract. Let /i, . . . , /t be positive definite binary quadratic forms, and let Rf . (n) = 
\{{x,y) : fi(x,y) = n}\ denote the corresponding representation functions. Employing 
methods developed by Green and Tao, we deduce asymptotics for linear correlations 
of these representation functions. More precisely, we study the expression 



i=l 

where the i/jj form a system of affine linear forms no two of which are affinely related, 
and where K is a, convex body. 

The minor arc analysis builds on the observation that polynomial subsequences 
of equidistributed nilsequences are still equidistributed, an observation that could be 
useful in treating the minor arcs of other arithmetic questions. 

As a very quick application we give asymptotics to the number of simultaneous 
zeros of certain systems of quadratic equations in 8 or more variables. 
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1. Introduction 

The distribution of prime numbers shares many properties with the distribution of 
numbers that are representable as a sum of two squares, an analogy that is occasionally 
employed to obtain model problems for questions about the primes. Let us consider the 
distributions of the two sets in arithmetic progressions. Starting with the average orders, 
we have on the one hand the Prime Number Theorem, asserting that 7t(x) ~ For 

' =5 \ ' lOgX 

the set 5* of sums of two squares on the other hand. Landau p/7j proved an analogous 
asymptotic in 1908, namely 

where i? = ^np.3(mod4)(l-P~')^'/'- 

Turning towards more general arithmetic progressions, let a and q be coprime in- 
tegers, then the primes congruent to a (modg) satisfy 7r(a;; g, a) ~ ^{qyiogx - Building 
on Landau's result and the analogy to primes, Prachar [20] proved in the 1950s that, 
when furthermore a = 1 (mod gcd(4, q)) holds, sums of two squares show the following 
behaviouiQ 

n=a (modijr) 

where 

^ ^ p=3 (mod 4) 

The factor (4, g)/(2,g) describes that the density of sums of two squares is twice as 
high in the progression n = 1 (mod 4) as it is in = 1 (mod 2). From pairs (a, g) 
that are not coprime, one needs to remove those choices from consideration that lead 
to whole progressions entirely lying outside the set S. Examples are integers n such 
that n = 3 (mod 3^), which are never a sum of two squares, or, numbers of the form 
(3 ■ 5)n + 3^, which can only be a sum of two squares when 3|n. When excluding such 
classes a (modg), the constant Bq only needs to be adapted by restricting the product 
over p = 3 (mod 4) to primes dividing q/ gcd(a, q). 

Thus, both sets, the primes and the sums of two squares, show some uniformity in the 
distribution in residue classes once one excludes residue classes that for obvious reason 
contain too few elements. 

It is natural to ask whether this uniformity carries further: is, for instance, the 
distribution uniform enough to determine asymptotically the density of solutions to 
linear equations within these sets? More precisely, we are interested in an asymptotic 
for correlations of the form 

t 

where the ipi : Z'^ ^ Z are affine linear forms and where the arithmetic function 
/ : Z — >■ M is either the characteristic function I5 of sums of two squares, or it is 
chosen to be the characteristic function of primes. 

"'^This compact formulation of the result is due to Iwaniec |15) . 



LINEAR CORRELATIONS AMONGST NUMBERS WEIGHTED BY R 



3 



Green and Tao studied these correlations in the case of the primes in [8]. They 
replaced for this purpose the sparse set of primes by a weighted version of asymptotic 
density 1 which is given by the von Mangoldt function. We shall not normalise the 
characteristic function I5 in an analogous way, but instead consider the intrinsically 
weighted function that is given by the representation function of sums of two squares, 
R{n) = \{{x,y) G Z : x'^ + y'^ = ''^ll- Counting lattice points in a circle of radius \/N 
immediately shows that the representation function has indeed an asymptotic density 
given by ^„^^ R{n) ~ vrA^. As we will see, the nilpotent Hardy-Littlewood method 
which Green and Tao developed to handle linear correlations among the primes can also 
be employed in the case of the representation function R{n). 

Instead of restricting attention to representations as sums of two squares, the slightly 
more general case of representation by a positive definite binary quadratic form f{x, y) = 
ax"^ + bxy + cy^ will be considered. The corresponding representation function is then 
given by Rf{n) = \{{x,y) : f{x,y) = n}\. 



Notation. Throughout the paper, we write [N] for the set of numbers {1, . . . , A^} and 
e(x) for exp(27rx). We let : N — )• denote the p-adic valuation. If T is a finite set, 
we use the expectation notation EtgT to abbreviate Yltex- ^ linear correlation is 
defined along a system \I' = {ipi, . . . , ipt) : Z'^ — t- Z* of affine linear forms. Such a system 
may be written as ^(n) = \E'(0) + for a linear map \E^. We regard as fixed, 

while ^'(O) may, for instance, depend on K. Thus, all implicit constants in asymptotic 
notation, such as 0(), o() and <^, are allowed to depend on the coefficients of the linear 
map \E', the dimensions d and t of the domain and the target space of \I', and on the 
discriminants of the forms fi, . . . , ft. 



Methods and results. The nilpotent Hardy-Littlewood method comprises a scheme 
that allows to obtain for any given arithmetic function h with sufficiently quasi-random 
behaviour an asymptotic for the expression 

t 

where K C M'^ is convex and satisfies ipi G [1, N] for each i G {1, . . . , if:}. We proceed to 
describe the basic set-up for the method. There are two main requirements on h. One 



is that h has small Gowers-uniformity norms (see Section 10) and the other is that one 
can find a majorant //:{!,..., A^} — )■ ]R>o such that 

1. {majorant property) the point-wise estimate h{n) ^ Cv{n) is satisfied for an 
absolute constant C independent of A^, 

2. [density condition) h has positive relative density in v in the sense that 

^n^Nhin) ~ C"E„<gAr 

3. (pseudorandomness) z/ is a pseudorandom measure. 

A pseudorandom measure resembles a true probability measure. Apart from requiring 
its total mass to be approximately 1, that is, Kn^Nuln) = l + o(l), there are two further 
defining conditions for a pseudorandom measure: the linear forms condition and the 
correlation condition. Each of them places some independence requirements upon u. 
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The linear forms condition for instance requires 

t 

i=l 

to hold for certain systems of affine linear forms. Once in possession of such a pseu- 
dorandom majorant, a number of tools are available. We will describe them as we 
encounter them. 

Regarding the first condition on h, which was the smallness of certain Gowers- 
uniformity norms, there is an explicit (and quite strong) necessary condition that has 
to be satisfied: h must be equidistributed in residue classes to small moduli. The res- 
ults quoted at the beginning of this introduction show that neither the characteristic 
function ls{n), nor its weighted version r(n) meet this requirement. In such a situation, 
it may be possible to decompose the function h into a sum of functions that are more 
uniformly behaved and consider each of these functions separately. This decomposition 
is known as VT-trick and will be carried out in Section [71 

In Section |2] we construct a majorant for the representation function attached to a 
primitive form /. This majorant will be slightly modified in accordance to the VT-trick 
in Section [7| In Section [9] we check that our majorant is indeed pseudorandom. 

In the course of the minor arc analysis, which starts in Section [TT| we observe that 
polynomial subsequences of 5-equidistributed linear nilsequences are still reasonably 



equidistributed. See Proposition |15.3| below. This result will be deduced from the 
quantitative equidistribution theory Green and Tao worked out in ^U\. In connection 
with their factorisation theorem [TOl Thm 1.19], it could prove a useful tool for the 
minor arc analysis of a wider range of arithmetic problems. 

Due to the quite complex foundations of the Green- Tao methods it proved not feasible 
to provide a self-contained account of it here. This paper therefore strongly depends on 
[8]. It furthermore relies on results about the divisor function from [18], which will be 
used in the construction of the pseudorandom majorants. 

Results. In p,8j a pseudorandom majorant for the normalised divisor function f(n) = 
(log A^)~^ Yld\n ^ been constructed. Here we shall combine this majorant with a siev- 
ing majorant to obtain a pseudorandom majorant for the function Rf{n) which counts 
the number of representations of n by a primitive positive definite binary quadratic 
form; results for the non-primitive case are immediate corollaries. 

With this majorant at hand, we obtain, employing the machinery from [HI [H] in com- 
bination with the inverse theorem for the Gowers-uniformity norms [TT] , an asymptotic 
for the representation function Rf evaluated along systems of linear equations: 

Theorem 1.1. Let fi, . . . ft be primitive positive definite binary quadratic forms. Let 
\I' = {ipi, ■ ■ ■ ,ipt) : Z"^ — )■ Z* 6e a system of affine linear forms such that no two forms 
ipi and ipj are affinely dependent. Suppose that the coefficients of the linear part \E' are 
bounded and that K cM.'^ is a convex body such that "^IK) C [0, A^]*. Then 

J2 RfAMn))--.RfMin)) = PoollPp + oiN'') , 

where 
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and 

Pp ~ ^^^^'^^i^/p'^^Y 11 ' 

with Pf^Ail) denoting the local number of representations of A (modg) by f, that is, 
Pf,A{q) ■■= \{{x,y) G [g]^ : f{x,y) =A (modg)}| . 

Theorem 1 1 . 1 1 extends previous results by Heath-Brown [12] and improvements thereof 
by Browning and de la Breteche P, where the case of sums of two squares, fi{x,y) = 

+ ?/^, for i = 1,...,4, together with systems : — >■ Z"^ was considered. We 
emphasise, however, that, in contrast to the results from [12] and [1], we unfortunately 
do not obtain explicit error terms in our asymptotic. 

The most interesting case of correlation along a system of affine linear forms is cer- 
tainly the 'infinite complexity' case of 

En^NRf{n + ai) . . . Rf{n + ad) , 

corresponding to the prime-tuples problem. In this case the linear forms involved are 
not independent and thus an asymptotic would give very strong information on the 
regularity of distribution of the function involved. Results of this type lie out of reach 
of the Green- Tao Hardy-Littlewood method. 

It is worth mentioning at this point a recent related result of Henriot [13], which 
provides a correct order upper bound for Kn^NF{\Qi{n)\, . . . , \Qt{n)\), where F : N* — )■ 
M^o belongs to a family of functions that does include F{ni, . . . ,nt) = Yll=i Rfii^i), 
and where the Qi are coprime irreducible polynomials. The bounds in this result are 
independent of the discriminant of the polynomial Qi . . . Qt- 



Theorem 1.1 has some natural arithmetic consequences. Analysing the frequency 
of 4-term arithmetic progressions in sums of two squares (weighted by the represent- 
ation function) may be viewed as a special case of studying the (average) number of 
simultaneous zeros of a pair of diagonal quadratic equations, namely solutions to 

I ™2 O-r^ O'T^ _L J- — n 

j> -j^ I .-^2 3 ^ I I Q — ^ 

t2 _|_ ™2 9'T>2 9t2 _l t»2 _i_ ™2 — ri 

I <^ ^ Zjjjg I tij Y I — ^ • 

While the respective system for 3-term progressions may easily be handled by the circle 
method, Heath-Brown mentions in p!2l that in order to give an asymptotic for the 
number of 4-term arithmetic progressions in sums of two squares "it would appear 
that one would require a version of the 'Kloosterman refinement' for a double integral". 
Browning and Munshi ^ very recently succeeded in showing that the circle method can 
in fact be employed to study the above system, and, moreover, any pair of quadratic 
equations in ^ 9 variables that takes the form 

Fi(x3, . . . , X„) = -C{xl + xl), F2(X3, ...,Xn) = . 

Previously, the classical Hardy-Littlewood method had been successfully apphed to pairs 
of diagonal quadratic equations in at least 9 variables: 

Theorem 1.2 (Cook [3J). Let F, G : — )■ Z be integral diagonal quadratic forms such 
that for all real A, fi, not both zero, XF + fiG is an indefinite form in at least 5 variables. 
Then there is some positive constant Kq such that the number of simultaneous integral 
zeros of F and G in the box 

P^Xi^CP, z = l,...,9. 
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is given by 

Af{P) = KoP^ + o{P^) as P ^ oo . 

Our result, which is in fact an analogue of [8, Thm. 1.8], considers certain highly 
singular systems of quadratic equations in 8 or more variables. 

Theorem 1.3. Let t ^ A and let fi,...,ft be primitive positive definite binary quadratic 
forms. For an integer s ^ t — 2, let A & MsxtC^) be a full rank matrix whose row-span 
over Q contains no non-trivial element with less than 3 non-zero entries. 
Define a height function H : Z^* — t- ]R>o by 

H{x) = ^.^max^^ fj{x2j-i,X2j) ■ 

Then the simultaneous zeros of the system of quadratic forms 

t 

Fi{xi, . . . ,X2t) = '^aijfj{x2j-i,X2j) , z G , 
i=i 

satisfies the following asymptotic: 

\{x e 1}' : H{x) < iV, = • • • = F,{x) = 0}| = — =|^L=«^ + o(iV 

y/\Di...Dt\ p 

where 

\{x e (Z/p"Z)2* : Fi(x) = --- = FJx) = (modp™)}! 
ttp := lim — — 

and 

a^:=\{zE{l,...,N^y:Az = 0}\ . 



2t\ 
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We conclude this introduction with the fairly short deduction of Theorem 1.3 



Proof of Theorem L3_ from Theorem \1.1\ The number of simultaneous zeros of bounded 
height of the forms Fi, . . . ,Ft can be reinterpreted in terms of representation functions: 



t 



\{x e 1?' : = ■ ■ ■ = F,{x) = 0,H{x) ^ N}\ = H^/.^^^) " (l'^) 

2e[Af2]*:A2=0i=l 

To turn the latter expression into the form of a linear correlation, we may follow [8|, §4]: 
Pick a basis for the integer lattice 

T := {z eZ^ : Az = 0} . 

Since A has full rank, F has rank d := t — s, and thus there are linear forms ipi, . . . , : 
Z"' ^ Z such that 

r = {(^i(n),...,^tH):nGZ'^}. 

This system of forms has finite complexity, as otherwise we would find i ^ j such that 
aiijji = djipj for some non-zero integers ai,aj. Hence, 

r = {z E : Az = 0, aiZi — ajZj = 0} , 

which implies by the full rank assumption on A that the row-space of A contains a 
non-trivial element with less than 3 non-zero entries, a contradiction. 
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Thus, (1.1) takes a form to which Theorem 1 1 . 1 1 apphes and we obtain: 

|{x G Z^* : = ■■■ = Fs{x) = 0,H{x) ^ N}\ 

t 

yl-LJi ■ ■ ■ L>t 



V 



Note that 



= \{ze{l,... N^Y ■Az = {]}\+ o{N^'') , 

which justifies to define Ooo := \{z G {1, . . . A^^}* : Az = It remains to interpret the 
local factors fip in terms of Fi, . . . , Fj. If m is sufficiently large, the Z-basis {ipj)j&[d\ of 
r gives rise to a basis of {z G {'L/p"^'LY : Az = (modp™)}, whence 

t 

= P-^'^'^'^ n G [p'"]^ : = ^,(a) (modp-)}| 

ae(Z/p'"Z)<^ J=l 

_ |{a; G (Z/p™Z)2* : = ■■■ = = (modp'")}] 

which yields /3p = Op for all primes p. □ 

2. A MAJORANT FOR THE REPRESENTATION FUNCTION VIA THE KRONECKER SUM 

Preliminaries and notation. Recall that a binary quadratic form f{x,y) = ax"^ + 
hxy + cy^ is primitive when (a,6, c) = 1 and that its discriminant is given by D{f) = 
— 4ac. Throughout this paper all binary quadratic forms will be assumed to be 
positive definite. The number of ways a form / represents an integer n is described by 
the representation function i?/ : Z — )• Z, defined by 

Rf{n) ■= \{{x,y) ■ f{x,y) = n}\ . 

In order to make use of some multiplicative properties of Rf, we introduce the function 
rf : Z ^ Z, defined by 

rf{n) := Rf{n)/k{D) , 

where k{D) denotes the number of automorphs of binary forms of discriminant D. We 
have k{D) = 6, 4, 2 according to D = —3, D = —4 or D < —4, respectively. 

Closely related to r/ is the function : Z — Z which counts — up to the factor 

k{D) — the number of ways n is represented by any equivalence class of forms of dis- 
criminant D = D{f). We define r£)(f) by 

'r'D(f){n) := ^Z'^"") ' 

D{n=D{f) 
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where /' runs through a complete system of representatives of primitive forms of dis- 
criminant equal to D{f). 

The function majorises Vf and has some properties that suggests it may be a 
good candidate to start the construction of a pseudorandom majorant with: on the one 
hand, the number h{D) of equivalence classes of primitive forms of discriminant D is 
finite, and thus the average order of rD{f) is comparable to the average order of r/; on 
the other hand, Ti:){/) has an arithmetic representation as a divisor sum, a structure that 
proved to be very suited for the construction of a pseudorandom majorant in both [6] 
and Haj. 

Let / be a primitive positive definite form of discriminant D. Then rj(n) = for 
n < and rj(0) = 1. For positive integers n coprime to D, td has the representation 



d\n ^ ^ 



as a character sum, where the symbol is a Kronecker symbol. For general rz, we pick up 
another factor which depends only on gcd(n, D) and the parities of the a in HpiDp'^lln^'"- 
We will see in Corollary |5.1 that 

rDU){n)<^DY.{^) (2-1) 



d\n 



holds for all n G N. 

Recall that the Kronecker symbol is only non-zero when its entries are coprime and 
that furthermore the following lemma holds; see for instance |H Thm. 1.14]. 

Lemma 2.1. If D = 0,1 (mod 4) is a non-zero integer, then there is a unique character 
Xd '■ (Ij/DZi)* —7- {—1,1} such that Xd{p (modD)) = for odd p coprime to D. 

Let Qr, denote the set of primes for which Xd{p) = Note that this is the union 
of the primes in a collection of progressions modulo D. By multiplicativity we have 



win ^ ^ Win nOlIn nQllnnCOr^ 

(2.2) 



d\n d\n p'^\\n P°||",PGSd 

where 



rD{m)= Y[ (a + 1). 

P°'\\m,XD{p)=^ 

We denote by Vd the set of primes for which Xd{p) = 1- Thus, a square-free number 
n is represented by some form of discriminant D{f) only if all of its prime factors belong 
to Vd or divide D{f). 

We can say a little more about the sets Vd and Qd'- Since x is a non-principal 
character taking values ±1, the fact that J2a&(z/DZ)* ~ ^ implies that both Vd and 

Qd are the union of the primes in exactly progressions modulo D. Thus, the square- 
free numbers that are coprime to D and representable by some form of discriminant D 
are those numbers whose prime factors belong to a set comprising asymptotically half 
the prime numbers. 
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As a last piece of notation, given any set V of primes, let (P) denote the set of natural 
numbers all of whose prime factors belong to V. Thus we may write 

'Tnin) = ^ ld\n ■ 

Construction of the majorant. The key observation for the construction of our 



majorant for rj is that according to (2.1) and (2.2) it suffices to find two majorants 
separately: one for a divisor-type function related to td, and one for the characteristic 
function of numbers without Q/5-prime factors. Writing V}j = Vd U {p : p\D}, the 
characteristic function of interest is The shifts by square factors of the form 

HpeSD infiuence the asymptotic density by a constant factor and may be taken 

care of separately. If i/ is a majorant for and if /3 is a majorant for l(pjj), then rf{n) 
is majorised by 

Oz)(l)z/(n) ^ (3{n/m^)lra2\n ■ 

The majorant (3 for l(-p*) will be chosen as a sieving majorant. In fact, the approach 
via sieve weights in [8] proves universal enough to apply here too without much change. 
Concerning z/, we make use of the results on the divisor function from [T8] . 

Since neither td nor has asymptotic density, we proceed to determine the av- 

erage order of td and show that E„^Arr£)(n) x (log NY^'^. This suggests to renormalise 
the factors in the bound on as follows 

(1^^ S l(P,)(n/m^)(logiV)V^ 

Iwaniec [I5] proves via sieve theory that it is indeed the case that l(pj^) is of average 
order (logA^)"^/^. This bound, however, is not needed here. 

Lemma 2.2. td satisfies the asymptotic bounds 

En^NTDin) X (log AT) ^ 

where the implicit constants may depend on D. 
Proof. We have 

1 IN] 1 , 



N 



To estimate the last sum, observe that on the one hand 

d^N P^N 

holds, where the last step follows from the prime number theorem in arithmetic pro- 
gressions in the form 

E P~' = ^loglog(iV) + 0(l) . 

p=a (mod q) 
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The above remains true when replacing Vd by (Q^) := Qd ^ {p '■ p\D} and 0(1) by 
Od(1). On the other hand the following chain of inequalities allows us to deduce a 
matching lower bound 




\ogN + 0{l) = \^-^[ >^ —II >^ — |«| — l(logiV) 



1/2 



□ 



The divisor-type majorant. To start with, we recall the divisor function majorant 
that was constructed in ^18j based on Erdos's work For any 7 > define the 
truncated divisor function : [A^] — > Z by 

and the truncated restricted divisor function t^,^^ : [A^] — )■ Z by 

d^N-t 

Proposition 2.3 ([IH], Majorant for the divisor function). Let C, = 2^™ for some 
m G N. Write Xq for the exceptional set of all n ^ N satisfying either of the following 

(1) n is excessively "rough" in the sense that it is divisible by some prime power p"", 

2, with p°- > log*^^ A^, or 

(2) n is excessively "smooth" in the sense that if n = HpP" then 

JJ^ p"" ^ iv?/i°gi°g^ 

Further, define U{i,2/^) := {1} for i = log2(2/0 - 2, and U{i,2/^) := else. If 
s > 2/^, write U{i,s) for the set of all products ofmo{i,s) := \^s{i + 3 — log2s)/100] 
distinct primes from the interval [iV-^/^*''^\ A^^/^']. Define : [A^] — )■ IR+ by 

(log log Af)^ 6 log log log N 

h{'^)-=Y 5Z 2nu\nT^{n) + Inexor(n) . 

s^2/g i^log2S-2 uet/(i,s) 

Then T{n) < ^(^{n) for all n < N. 

Note that the main term of z/g has low complexity in that it only involves small 
divisors since all u ^ A^^. Restricting all occurrences of divisor functions in z/^ to only 
count divisors in (Vd), yields a majorant for To of the same order of magnitude as 
To- We make one further modification and replace the cut-off in the definition of to^^ 
by a smooth cut-off of the form which appears in Green and Tao's A-majorant. This 
turns out to be advantageous when establishing the linear forms condition. Thus, let 
X : M — )■ be a smooth, even function that is supported on [—1, 1] and satisfies the 
properties xi^) = 1 for a; G [—1/2, 1/2] and dx = 1. Define t^^^ : [N] ^ Z by 



d^N^ 
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Then ^ '''h-yi''^) ^ ''~D,-y{n) holds. With this definition we have the following 

lemma. 



Lemma 2.4 (A majorant for td). Let the sets U{i,s) be those which Proposition 2.3 
produces for ^ = ■j/2. Let ud^^ : [A^] —> M. be defined by 

^ / (log log A^)^ 6 log log log N ^ 

CuD,^{n) := —==i J2 Y '^'Un^*D,y{n) + In^XoToin) 

^ V s^4/7 i>log2S-2 u£U{i,s) 

Then TD{n)/ {log Ny/"^ ^ CuD,-y{n) for alln G [A^] and there is some constant C bounded 
independently of N such that ¥.n^Mi'D,'y{n) = 1 + o(l). 

Proof. We begin to check the majorisation property. For any n G [A^], write n = n-pm 
where n-p is the largest factor of n that belongs to (Vd)- Then 

TDin) = r{nv) ^ z>^/2(np) ^ C(log Ar)i/VB,^(np) = Cilog N^^^udM , 

as required. The existence of C follows as in the proof of [18, Prop. 4.2], taking into 
account that IEm^ArTD,7/2("^) ^ (logA^)^^^, which is proved in much the same way as 
Lemma Ell " ' □ 

The sieving type majorant. The next task is to give a majorant /3 : N — for the 

characteristic function of the set {V}j) of numbers without Q^-prime factors. Adapting 
the Selberg-Sieve majorant for primes from [8] to the set l(-p|,), we aim to remove all 
integers that have a prime factors p from with p ^ A^'''. Let x : M — )• M be a smooth, 
even function that is supported on [—1, 1] and satisfies the properties x(0) = 1 and 
lo dx = 1. Define in analogy to [8^ App. D] 



Pin) := AD,x,,in) ■= C'(log Ar)V2 j ^ ^^^^ ^ ^ 



d\n,de{QD) 



logd 

log N'y^ 



for some constant C. The results from [8] show that C may be chosen such that 
En^NPin) = 1 + o(l). This will play a role in Section|9| Note that /3(m) = C"(log A^)^/^ 
at every Q/) -prime-free integer m ^ N, and thus we have the pointwise majorisation 

l(PM(n)(logA^)^/2 ^ C'-'Pin), n e [N] . 



3. A REDUCTION OF THE MAIN THEOREM 

While it is possible to apply the nilpotent Hardy-Littlewood method to the repres- 
entation function Vf itself, it is the aim of this section to show that we can deduce 
the main theorem from a similar statement about a smoothed version of r/, that is, 
a function that agrees with Vf everywhere except on a sparse set where the restricted 
divisor function T£, shows exceptionally irregular behaviour. 

First note that the pointwise bound rj(n) ^ rij(^f^{n) ^ r(n) for n G N of the 
representation function of any primitive positive definite quadratic form / by the divisor 
function gives the following second moment estimate. 

Lemma 3.1 (Second moment estimate). Let fi,...,ft be primitive positive definite 
binary quadratic forms and let = [ipi, . . . ,1^1) : Z"^ 7} be a system of affine-linear 
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forms whose linear coefficients are bounded by L. If K G M'^ is a convex body such that 
C [0,NY, then 

Proof. Let K' := {x E K : G [1, A^]*}. Then Holder's inequality yields 

i&[t\ ie[t] 
ie[t] 

The remaining steps are standard; cf. the proof of [HI Lemma 3.1] for details. □ 

The next lemma, which is a combination of some technical lemmas from [5] , describes 
an exceptional set for the divisor function, i.e. a sparse set containing those numbers on 
which the divisor function behaves particularly irregularly. 

Lemma 3.2. Let Ci > 1 be a parameter and write Xq for the set of all positive n ^ N 
satisfying either of the following 

(1) n is excessively "rough" in the sense that it is divisible by some prime power p"", 
a ^2, with p°- > hg'^^ N, or 

(2) n is excessively "smooth" in the sense that if n = YlpP"^ then 

Yl ^ jY7/loglogJV^ 

(3) n has a large square divisor m'^\n, m > N'^ . 
Then 

t 

5^^(n)eXo«log-''^/'iV. 
1=1 

Proof. See [3] for the original results or [TSl §3] for their adaptation to this situation. □ 

The previous two lemmas allow us to deduce the main theorem from an equivalent 
statement about smoothed versions of the representation functions rj^. The particular 
smoothed functions we shall work with will be chosen in Section [71 

Lemma 3.3. Let fi, ■ ■ ■ , ft be primitive positive definite binary quadratic forms. For 
each i G [t], let ff. : {0, . . . , N} — )■ M denote a function that agrees with Vf^ on [N] \ Xq, 
that is, outside the exceptional set of the divisor function, and which further satisfies 
^ rf.{n) ^ Tf^in) for all n G Xq U {0}. // the parameter Ci of the exceptional set is 
sufficiently large, then the main theorem holds if and only if under the same conditions 

ff,{^,{n))...fjXMn))=P^\[^ + o{N'') . 

Proof. This follows by the Cauchy-Schwarz inequality from the previous two lemmas 
and the bound 

t 

□ 
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The above lemma in particular shows that a pseudorandom majorant used in a proof 
only needs to majorise the function rj (or ff) on the set of positive unexceptional 
integers. We can therefore truncate the summation over dilates of m^, m G (Qd) in the 
majorant to those m with m < N''. Furthermore, we may restrict attention to the case 
where ^(/T) C [1, AT]* 

4. Distribution in residue classes 

The transference principle from [HI E], which we shall employ later, only works with 
functions h that are sufficiently quasirandom in the sense that all f/'^-norms — E/i||[/fe 
up to some order k, determined by the specific system \Ef one is working with, are small. 
A necessary condition for the uniformity norms to be small, is that the function h at 
hand is equidistributed in residue classes to small moduli. This condition is in fact 
equivalent to requiring that h does not correlate with periodic nilsequences of short 



period, cf. Section 11 



As seen at the start of the introduction, the representation function rj does not have 
this property. To remove these obstructions to uniformity, one can try to split the 
function into a sum of functions each of which does not detect a difference between 
residue classes to small moduli. This strategy is known as ly-trick. In order find a 
suitable decomposition, we shall investigate the quantities 

IE„<Arl„=^ (modg) rf{n) 

for fixed period q and fixed residue class /3. Define 

PiAq) ■= lii^^y) e k? ■ f{x,y) = (3 (modg)}| 

to be the number of representations of /3 (modg), and let K{N) = /~^([0,A^]) C M^. 
This is a the area enclosed by the ellipse f{x,y) = N and hence a convex set of volume 

A volume packing argument, cf. [Bl App. A], yields 

^ rf{n)kiD)= Yl l = ^4^vol(i^) + 0(v^g) , 

n<:N {x,y)<^K{N)r\I? ^ 

n=/3(modg) /(a:,i/)=/3 (modq) 

which proves the following lemma. 

Lemma 4.1. Let P := {n ^ N : n = 13 (modg)} be an arithmetic progression. Then 
the average of the representation function of f along P satisfies 

In view of this lemma it is not surprising that we will make use of several further 
observations on the densities PfAq)q~^, which will be established in Section [6] 

Lemma 4.2. Pf,/3{q) only depends on the genus class of f . 

Proof. Two forms /i and /2 belong to the same genus if and only if they are locally equi- 
valent in the following sense: for every non-zero integer m there exists a^n G Gl2{'^/rnL) 
such that 

h{.x, y) = /2((a;, y)am) mod m . 
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Thus, Pfi,i3{q) = P/2 foi' ^-ll positive integers q and all /3 G [g]. □ 

The reason this lemma is important to us is that it allows us to consider instead of Vf 
the following more regularly behaved function in all questions regarding the distribution 
in residue classes. Let the genus class representation function rg{n) : N — )■ N be defined 

by 

rg{n) = E{rf{n)\f G g) , 
where / runs through a system of representatives of classes in the genus g. Under the 



assumptions of Lemma |4.1| we then have 

E„6pr^H = E„epr,(/)(n) + 0{\P\''/'q') , [4.1] 
where g{f) denotes the genus that contains /. 

5. Results from the theory of binary quadratic forms 



The aim of this section is to prove the bound (2.1) on the number of representations 
of a positive integer n by a form of discriminant D, which was used to construct the 
majorant function in Section [2j 

5.1. Representation by primitive forms of fixed discriminant. The question of 
whether or not m is properly representable by a primitive form of discriminant D is 
linked to the solubility in x of the congruence 

^ D (mod 4m) ; (5.1) 

see p. 506] or pU p. 172]. If / is a form of discriminant D that represents m properly, 
then / is equivalent to (m, n, *) = mX"^ + nXY + where D = — Amk for some 
integer k. 



Claim 1. Consider the solutions x = n to (5.1) that satisfy < n ^ 2m. These 
form a complete set of incongruent solutions modulo 2m. Those solutions among them 
for which {m,n, [D — n^)/4m) is primitive are in one-to-one correspondence with the 
distinct classes of primitive forms that represent m properly. 

Proof. Let / be a primitive form and suppose there are coprime u and v such that 
f{u,v) = m. Choose a solution (2:0,^0) to 1 = uzq — vwq. Then 

/'(X, Y) := /((X, r) f ^ )) = mX' + nXY+ ^^^Y' 
' ^ ' ^ ' ' ^ \ Wo zo J ^ 4m 

is an equivalent form with leading coefficient m. Choosing different solutions w = Wq-\-w' 
and z = zq -\- z' to 1 = uz — vw, we have w' = tu and z' = tv for some non-zero integer 
t, which implies that the middle coefficient n is unique modulo 2m. In particular 
(m,ni,*) ~ (m, ^2,*) if and only if ni = n2 (mod 2m). 

Observe that in the other direction every solution x = n to x^ = D (mod 4m) yields 
an equivalence class (m, n, *) of forms of discriminant D that represents m properly. □ 

In order to determine the number of classes of forms that represent m, we are inter- 
ested in two pieces of information: 

1. the number of solutions x to x^ = D (mod 4m), and 

2. how many of these solutions yield primitive forms (m, x, *) of discriminant D. 
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A third necessary piece of information regards the number of proper representations 
by a fixed class of forms: any two proper representations of m by a fixed form / are 
related by an automorph. Thus each class C{f) of forms equivalent to / represents m 
properly in k{D) different ways, where k{D) is the number of automorphs of forms of 
discriminant D. 



In order to analyse the the number of solutions to (5.1), we introduce the related 
irreducible quadratic polynomial P{x) = — D, which has discriminant AD. 

Let p{a) := \{k G [a] : P{k) = (moda)}| denote the number of zeros modulo a. The 
counting function p is multiplicative by the Chinese remainder theorem, which leaves 
us to determine p at prime powers. If p \ D, then (cf. p3l Thm 12.3.4]) 

( 2 ifj9 = 2,a = 2 

p(p")= <^ 2(1 + xd(p)) ifp = 2,a>2 
[ 1 + xoip) ifp>2 . 

In the remaining case of primes p\D, Hensel's lemma implies that 

p{p») = p(p^f(4D)+i) if ^ ^ Vp{4D) . 

For p\D we will show below that, in fact, there are no primitive forms that properly 
represent an integer m with Vp{m) > Vp{D) for some prime p. 

If m is coprime to D, then each solution to = D (mod 4m) yields a primitive form, 
and p(4m) is directly linked to the number r|)(m) of classes of primitive forms that 
represent m properly: r}j{m) = |p(4m). 

We turn to the case where gcd{D, m) > 1. If there is a prime p dividing gcd(Z), 4m) 
to an odd power, then solutions to D = n"^ — Amk yield primitive forms if and only if 
each such p divides both D and 4m to the same power. 



Considering the set of forms arising from solutions to (5.1 ), we can, if gcd(m, AD) > 1, 



retrieve the number of primitive forms among them via an inclusion-exclusion argument. 
Indeed, when d = gcd(m, n, k), then m/ d is properly represented by the form ^, |) of 
discriminant Dd~^. Note that automorphs of forms of the first kind are also automorphs 
of forms of the second kind and vice versa. 
Let p"||4m and suppose that p'^||-D, a > 1. 

We begin by analysing the largest range for a, a > cr > 1. When a is odd, then there 
are, as seen above, no primitive forms that represent p"' properly. Suppose next that a 
is even and define 

p'{p") ■= \{x ■.x^ = D (modp")}| - |{a; : x^ = Dp'^ (modj9°-^)}| . 

This quantity counts the number of solutions to x^ = D (modp") for which = D+kp" 
for some k not divisible by p. The expression for p'{p'^) simplifies to 

p'{p^) = \{x:x^ = Dp-'' (modj9"~")}| - \{x : x^ = Dp'"" (modp"-^-^"-^))}! , 

which is seen to be by Hensel's lemma (note that p \ Dp^^). Thus, no power p°' \ D 
of a discriminant-prime with p'^\D is properly representable by a primitive form. 

What remains are even powers p"|-D, a < a and the case In the former case, 

any solution to D = n'^ + Amk with p"/^||n satisfies p \ k. Hence there are p°/^(l — p^^) 
choices for n (modp°). In the latter case, p \ k holds if and only if pr"/2l hence there 
are pLa/2j choices in this case. 
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In total, the number r*(m) of primitive forms properly representing m is given by 

^(l + l2tD) n (^ + Xd{p)) n (?"^'(l-?~')Wlaeven + gL°/2Jl^^^) , 

p\m,p\D q\D 
q" ||4m, 

where p and q run over primes, and where the factor 1/2 takes account of the fact that 
for every solution x G [4m], x + 2m is the unique other solution determining the same 
class of forms. 

Collecting everything together, we obtain the following explicit expression for vd: 

Corollary 5.1. The total number of representations (proper and improper ones) of an 
integer m by classes of primitive forms of discriminant D satisfies 

52 |m pfD q\{D,m) ^ ^ 

((5,D)=lp"||m<5-2 g"||4m 

q'^\\4:D 

d\m 

where p, q run over primes. 

5.2. Representation by genera. Recall that the representation function : N — N 
of a genus class g was defined to be rg{n) = E(r//(n) | /' G (7), where /' runs through a 



system of representatives. This function is of interest since by Lemma 4.1 and Lemma 



4.2, it has the same distribution in residue classes as any function rj with f ^ g. We 
aim to reduce the problem of determining the number of representations of an integer 
n by a specific genus class to that of counting certain representations of the factor n' of 
n that is coprime to D. 

This is advantageous for the following reason. The values in (Z/DZ)* that are rep- 
resented by a form / with D{f) = D form a coset of the subgroup in (Z/DZ)* that is 
generated by the values the principal form represents, c.f. [U Lemma 2.24]. Thus, differ- 
ent genera represent disjoint sets of values in (Z/DZ)*. This means that the character 
sum expression of the function r£,(^f) which counts representations of all classes in h{D) 
yields an arithmetic expression for the function Vg which just considers those classes of 
genus g. Indeed, let TZg denote the non-zero residues modulo D that are represented by 
forms in g. Then for n' coprime to D we have 

rg{n') = X] ln'=6(modD) ExD(rf) ■ 

beUg d\n' 

For an arbitrary positive integer n, let n = njyfi'^n' be the factorisation for which n' 
is coprime to D and is the largest divisor n£)\{n, D) such that ^^7^ = is a square. 



This factorisation is chosen in such a way that Corollary 5.1 implies roin) = r£,(n'n£)) 
which is of interest because in n'riD the factor that is not coprime to D is bounded. 

Let {n'nD,b,c) be a primitive form properly representing n'riD. Then, since n^l-D, 
we have {nu,b) > 1 and hence (n£),c) = 1 by primitiveness of the form. Since further 
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{rio^n') = 1, we have 

{n'nD,b,c) ~ {c,-b,nDn') 

~ (c, —b, nnn) * {no, —b, n'c) * {n'c, —b, rin) 

~ {cnr,, —b, n) * {n'c, —b,nD) 

~ (n', b, ctid) * {riD, b, n'c) . 

Note that all forms involved are primitive. 

Thus, we can decompose the representation into separate ones for the coprime factors 
n' and hd. We aim to use this multiplicative property of representation by primitive 
forms of fixed discriminant in conjunction with the following lemma. 

Lemma 5.2. The principle genus Qq is a subgroup of the class group (a finite and 
Abelian group). The genera form cosets of Qo the class group. 

Proof See e.g. [211 P-197, Thm 2.8]. □ 

With the help of this lemma we have 

rg{n) = \g\^rg^g/-i{nD)rg'{n') . 
g' 

If the residue n' {modD) is representable by a form of discriminant D, then let gn' 
denote the unique genus class that represents n' (modD). We may use the arithmetic 
representation of r^, ^ to obtain the following lemma. 

Lemma 5.3. Given n = nDn'n"^ as above and a genus class g, then 

rg{n) = rg,g J Myxoid) . 

d\n' 

6. REPRESENTATION IN Z/qZ 

This section contains several results on the densities Pfi,i3{p"')p~"', which will be es- 
tablished using results from the previous section and the following proposition. 

Proposition 6.1. Let P = {qom + /3o : m ^ M} be a progression such that D\qQ and 
/3o ^ (modp") for any p"||go- Then 

E^^pJ^XDid) = Cllil- XDip)p-') Yl + O(^), 

d\n p\qo a^O 

where C = (1 + xd) = 0(1). 

We defer the proof to the end of the section. The following lemma is a rather imme- 
diate consequence. 

Lemma 6.2. Let q be a positive integer that is divisible by D and let P G [q] be such 
that /3 ^ mod for any p"\\q. Then 



Q 

^ p\q a^O 



where C = r -i(/3,,)(l + xnij^mD) = 0(1). 
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Proof. By Lemma 4.1 and Lemma 5.3 we have for P(M) = {m = j3 (modg) : m ^ M} 
PfA^) 27r _ E„eP(M)r/(n) = lim EneP{M)rg(f){n) 



j*g-'iM lim E„6P(A/) VxD(rf) • 

d\n 



r 

g* 



By Proposition 6.1 and the class number formula the result follows. □ 

With the help of the previous lemma and a result of Steward |22|, we obtain the 
following more exphcit information on the densities 

Lemma 6.3. (a) Let po be a prime that divides D and suppose that /3 ^ (modpg). 
Then 

P/,/3(Po)Po" = 0(1) 

as (3 and a vary. If a ^ Vp^i^D) and (3^0 (modpg), then 

P/,/3(Po)Po" = P/,/3+fcp?(Pr^)Po ^"^^^ 

for any k G Z/po^ 

(b) If pq\ D then we have for (3^0 (modpg) 

P/,/3(Po)Po " = (1 - Xd{Po)Po^) Y1 lpJ|/3^^(^o) • 

(c) Let p be any prime. Then 

Pffi{p'')p-" « « 

holds. 

Proof, (a) We may assume a > Vp^{D). Let /3o G {l^jDl^* be a residue representable 
by / and let f3\ be such that (3i = (3 (modpg) and (3i = (3q (modp^''*^'^^) for any prime 
divisor p ^ pq oi D. By choice of (3q we have p f ,i3^{p"^'^^^) ^ 1 for p 7^ po- The previous 
lemma yields 

whence the first part of (a) follows by multiphcativity of p. Define /32 G Z be such that 
(32 = (3o (modp^*"*^^^) for any prime divisor p ^ po of D and (32 = (3 + kp^ (modpo"*"^)- 
Then, by Lemma [6. 2 [ 



PfMPo^^) _ Pf,nkp^iPo^^) 



p\D,p=/=po 



pVp{D) 



a+1 

Po Po 



The proof of part (b) is almost identical. Let /3o € {Z/DZ)* be a residue representable 
by / and let (3i be such that (3i = (3 (modpo) and (3i = (3q (modp'^*''^'^^) for any prime 
p\D. Then {(3i,D) = 1 and g{f) is the unique genus class representing /3i (modD). 



LINEAR CORRELATIONS AMONGST NUMBERS WEIGHTED BY Rf 19 

Hence '^g{/)*c,-/(l) > 1; as the principal genus represents 1. Since f3i is representable by 

^1 

f, there is some m such that ^d\mD+/3i Xoid) > 0, hence, in particular xoif^D + = 
Xoifii) = 1- Two applications of Lemma 6. 2| yield 

—a- - - paj^ [~^~ ) ^ 1^ KiwXDip^o) ■ 

Part (c) follows from [22l Corollary 2], which implies, as shown in [2], Lemma 31], 
that any quadratic polynomial P{x) = aix^ + a2X + as of discriminant Dp = a\ — 4aia3 
satisfies 

\{x e Z//-Z : P(x) = (mod/)}| ^ 2p^''(^^)/2 ^ 

Consider for fixed y the polynomial Py{x) = f{x,y) = ax^ + bxy + cy"^ of discriminant 
y^D[f ). There are less than values of y G 'L/p'^'L for which p^\\y. Thus 

fe=0 

□ 

An immediate corollary, which will be essential for the VF-trick, states that the p- 
densities are constant for lifts of non-zero residues /3 (modp") to higher powers of p: 

Corollary 6.4. Letp he a prime and suppose that a ^ Vp{D) and that /3 ^ (modp"). 
Then 

for all k G Z/pZ. 



Proof. This follows from part (a) and (b) of Lemma 6.3 □ 



Proof of Proposition \6.1\ Multiplicativity and the assumption on /3o yield 

d\n ^ p\qo o^O ^ d\qm+l3 

where (g, (3) = 1, and q and go have the same prime divisors. We will estimate the 
mean value of '^a\qm+i3XD{d) by the hyperbola method. Recall that xd is a character 
to the modulus HpIdP (^f- [13 Ch.9.3]) and let xh the character to the modulus 
ripi^P that is induced by xd- (Note that q is divisible by HpIdPO Thus xhi^) is only 
non-zero when n is coprime to q. Then 



d\qm+l3 \d\{qm+l3) d\(qm+p) / 

d^T d>T 



where the cut-off T will be chosen as T = \fqM. We begin with the large divisors. 
Writing G := (Z/gZ)* and denoting its dual group by G, we have 

d\{qm+l3) n^Mq I I d\n 

d>T d>T 
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Since ^ gM, this equals 



M|G| 



j^gG m^T T<d^Mq/m 



The character is a non-principal character to the modulus q unless x is the character 
X*D induced by xd- We consider the cases x — X*d x¥' X*d separately. For x X*d: 
we have 



M\G\ 



If X = Xd> we have 

^ \x^(/3)E^^M E (XdXdM 



M\G\ 



Xi?(^)^7^ E E ^rn\nXD{m) 
{n,q)=l 



(m,g)=l 

^^^^^^^ E Xi^M ^^'^^' +Oaoggv^) 

(m,g)=l 

-XniP) E E xMm)m + 0(loggyPf). 

{m,W)=l {m,W)=l 

The second term is seen to be small, that is 0{-\Jq/M), by partial summation. The first 
sum, X^m^VP? ^'^nf' ' ^ partial sum of the convergent series 

Y-XdM Y[(^ Xd{p) \t(, , n TT/'i Xd{p) \ 27rh{D) 
E = 11 (,1 - — = 11(^1 - — j ^(^)^ ■ 

Bounding their difference sum by partial summation, we obtain 

Hence, the large divisors satisfy 

E^<M Yl xUd)^XD{fS)l[{l-^^)L{l,XD) + 0{\ogqy^) . 

d|(gm+/3) p\q 
d>T 
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Concerning the small divisors sum, we obtain 



=^(i,xz.)n(i-— )+o(v^)• 



p\q 

Putting things together, we obtain the estimate 

Eo^rnOf Yl XDid) = il + XDmLa,XD)ll{l-^^)+0{\0gqy^) , 

d\qm.+l3 p\q ^ 

which proves the resuh. □ 



7. W^-TRICK 

The aim of this section is to find a decomposition of the function rj into a sum of 
functions that are equidistributed in residue classes to small moduli. 

In the case of primes, see [B] , this was achieved by defining W = Y[p<w{N) P 
product of primes up to w{N), where w : N ^ M is a slowly growing function. For n 
with gcd(n, W) = 1 the von Mangoldt function then splits as 

H^) = ^ Hn)ln=a(raodW) , 

and it suffices to consider the functions n ^ A{Wn + a), a G (Z/VTZ)*, which are 
equidistributed in residue classes to small moduli. 

In the case of the divisor function, the most natural decomposition makes use of the 
restricted divisor function that only counts divisors coprime to W (and is thus likely to 
be a quasirandom function): define 

d:{d,W)=l 

Then 

r(n) = r'(n)^l^|„ , 

w 

where w runs over all integers entirely composed of primes ^ w{N). The second factor, 
J2w '^w\n, is almost periodic. Indeed, let a{p) be such that 

Then any number n that is divisible by some w as above with \w for some p ^ w{N) 
belongs to the exceptional set Xq from Lemma 3^ Choosing 

:= JJ p^'^P'^ , 

one can achieve that the second factor is a periodic function of period W, when adjusting 
the values of r at exceptional integers. This way, it suffices to consider the functions 
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of the form n t— )■ r'iWn + a) for non-zero residues a G \W]- In fact, observing that 
T{Wn + a) = r'iWn + a) X]«;|vF ^w\a unexceptional values of a, we essentially consider 
functions of the form n i— )■ riWn + a). 

In the case of representation functions a very similar ly-trick works. We use the same 
choice of W as in the divisor function case above. 

Definition 7.1. Let A he the set of residues a {modW) such that 

pfAw) > 

and such that a ^ (modp"'-^-') . 

Thus A contains only residue classes that are representable by /, and every n G [A^] 
which fails to satisfy the second condition, that is, for which n = (modp"*^*')) holds. 



belongs to the exceptional set Xq from Lemma 3.2 



Definition 7.2 (Normalised and ly-tricked representation function). Let /3 & A and 

define r'^^^ : [N /W] by 



27T \ W 



Thus, by Lemma 4.1 



7.1. The major arc estimate. Our next aim is to give a major arc estimate for the 
ly-tricked function r'j-f we show that this function has, up to a small error, a constant 
average on arithmetic progressions whose common difference is small in the sense that 
it is w(A^)-smooth. 

Definition 7.3. An integer is called k-smooth, when each of its prime divisors is at 
most k. 

Proposition 7.4 (Major arc analysis for r'jr^). Let P C [A^/iy] be a progression of 
w{N)-smooth common difference qi and let P & A. If P = {qim + qo : ^ m < M} 
has length M , then 



^<zpr'fp{n) = Eo^rrKMrf^i^iqim + go) = 1 + O^- 



MV2 



Proof. Corollary |6.4| implies 

W Wqi 

Hence the result follows from Lemma [4.11 □ 
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7.2. VT-tricked majorant. Finally, we need to slightly adapt our majorant function 
for rf to its Vl^-tricked version. Let /3 & A. Then Lemma 4.2 and Lemma 5.3 yield the 
pointwise majorisation 

f'^gf d\Wn+l3 

Xd{p) 



0,1) n (i-^)"' E 

p<w{N) d\Wn+l3 

p\d^p>w{N) 



where the last step uses Lemma 6.2 Since each function r'^, p{n) has average order 
1 + o(l), the last expression is of bounded average order. Thus, the function 

^'du){'^) ■= ^^^^ 

d\n 

p\d=>p>w{n) 

may be used in place of ?"d(/) to run through the construction of the majorant as in 
Section [21 In view of the results from that section and the remarks at the end of Section 
[3] we find 

r'D{f)in) ^ /3d,7(^)^,7("') ' 



where 



(log log 7V)^ 6 log log log N 



s^2/7 i>log2S-2 u&U{i,s) 



with 



1 f ^Qg^ ^ 



deiVn) 
p\d=^p>w(N) 



and 



me{QD) ^ deiQo) 
p|m=>p>iu(A'') p\d^p>w{N) 
m<N"' 



For the two factors ,^ and z/^ ,^ one shows in the same way as for the original majorants 
that 

q<^QD 

and 

p<w{N) 

are bounded independently of A^. Since ^ and z/^ ^ are given by short divisor sums 
running over coprime sets of divisors, the average order of their product satisfies 
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Indeed, for coprime integers 2/1,2/2 < N'^, we have 

and since the total number of divisors in the sum E„<jAr/3^ ,y(n)z/^ is N^^^\ the 
statement follows. 

Since np^«,(Ar)(l ~ Xd{p)p^^)^^ = C + o(l) for some constant C, we have proved the 
following lemma. 

Lemma 7.5 (l^-tricked majorant). Let 13 & A, then 

r'fA^) < P'or^Wm + /3)^,^(irm + (5) 
for all m ^ N/W . Furthermore, there is a positive real number Cd,j = 0(1) such that 

E„^7V = 1 + o(l) . 

8. Local factors and the reduction of the main theorem to a 

l^-TRICKED VERSION 

Define the smoothed representation function ff : [N] — i- M by 

fj{n) := rj(n)l„ (^^odW)eA ■ 



According to the definition of A, this function satisfies the conditions of Lemma 3.3 



Thus it suffices to study correlations of functions rj in order to prove the main theorem. 
As the main theorem will show, the asymptotic behaviour of these correlations, 

J2 ff,{Mn))...ffAMn)) , (8.1) 

is determined by the local behaviour of the affine-linear system \1/ modulo small primes. 

By splitting the summation range into progressions of common difference W, we 
reduce the task of estimating (8.1) to an assertion. Proposition 8.1 below, about the 
uniformity of the VT-tricked representation functions. Local factors measuring irregu- 
larities of the system \1/ modulo small primes will appear in this process. 

Define for fixed quadratic forms fi,...,ft and for an affine-linear system \1/ : Z'^ — i- Z* 
the set of residues 

A^ :={a e[W]'^: ^i{a) G Af^ for all i e [t]} 
t t 
={a e WY ■■ \{Ph,^.ia){W) > and HU^) ^ (modp^-^)} . 

i=l i=l 



Notice that any n with non-zero contribution to (8.1) is congruent modulo W to an 
element of this set. For a fixed element a G Aqi let \I' = {ipi, ■ ■ ■ yipt) : Z'^ — )■ Z* be the 
affine-linear system satisfying 

ilji{Wm + a) = Wijji{m) + Ci{a) 

with Ci(a) G [W]. Thus, ipii^a) = Cj(a) (modW), and ipi and ipi only differ in the 
constant term. 

The main result will be deduced from the following Proposition. 



LINEAR CORRELATIONS AMONGST NUMBERS WEIGHTED BY Rf 25 

Proposition 8.1. Let \E' : Z'^ — )■ Z* be a finite complexity system of forms, let a G A<!/, 
and let : Z*^ — )■ Z* be defined as above. Then 

t 

E n^kc.(a)(^.(^)) = + o{{N/wy 

where K' C [Q^N/WY is a convex body such that W^{K') + c(a) C [l,iV]*. 
For every a G define the convex body 

Ka:= {x eR'^ -.Wx + ae K} 



and note that vol(-ft'a) = vol(-ft') /W . Then we can rewrite (8.1) by means of Proposition 
18.11 as follows 

t 

_ vol(ir) + ojN'^) ^ ' P/..^.-(a)(^) 27r 

By the Chinese remainder theorem, the above expectation is in fact a product over local 
densities, that is 



W 

1 = 1 



n ^a6(Z/p"(p)Z)d n " pQ(p) l5/',(a)^0(modp"(p)) > (8-3) 



p<w{N) i=l 



where a{p) = Vp{W). To complete the proof that (8.2) and (8.3) indeed imply the main 
theorem, two further lemmas are required. The first shows that the above factors at 
primes are essentially local factors: 

Lemma 8.2 (Local factors). Let p be a prime. Then 

^Eae(Z/p°(p)Z)d Y\. — 'r,c{p) '^M<i)^0 (modp"(p)) = /^p + O ( log ^^^^ N) 

where 



i=i ^ 



/3p :— lim IEae{z/p™z)d I I 



.=1 
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is the local factor at p. 

The second lemma is an estimate of the local factors. 

Lemma 8.3. Let \1/ = {ipi, . . . ,ipt) : he a system of affine-linear forms for 

which no two forms ipi and ipj are affinely dependent, and all of whose linear coefficients 
are bounded by L. Then 

(3p = 1 + OtMP'^) ■ 

Thus, 



A 



V 



A second consequence of this lemma is that /3p + 0(log ^^^^ N) = j3p{l + 0(log '^'''^ A^)) 
for all p ^ 1. For the remaining p <^ 1, we require an upper bound on f3p. Since Lemma 
^ implies p fi,A{p °'^^^)p~°'^^^ ^ ^ log log for any A G Z/j9°*^p^Z, we may deduce 
n Lemma 8.2 the very crude bound /3p ^ (log log A^)*. Thus, by (8.2), (8.3) and the 



from Lemma 8.2^ 

two lemmas stated above, we obtain 

= iP^ + oiN')) H (/3p + 0(log-^^/^Ar) 

p<w{N) 



p<w{N) 



/3oo n Ap+o(^') 

p<w{N) 

(3ooll^ + oiN'') , 



where we used that w{N) = log log A^. Apart form the proof of the two lemmas, we 
have reduced the task of establishing the main theorem to that of proving Proposition 

EH 

We conclude this section with the proofs of the lemmas, for the purpose of which the 
following notion is introduced. 

Definition 8.4 (Local divisor densities). For a given system \1/ = {ipi, . . . jipt) of 
affine-linear forms, positive integers di,...,dt and their least common multiple m : = 
lcm(c?i, . . . ,dt) define local divisor densities by 

a-^{di, ■ ■ ■ ,dt) := ^neiZ/mZy Y\. ^■4'iin)=0 (modd,) • 
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Proof of Lemma \8.^ We shall show more precisely that f3p satisfies 



(3p =IEag(2/pa(p)z)d Y\ ' lt/'i(a)^0 (modp°=(p)) (8-4) 

1=1 ^ 

t 

+ 0^(a(p))*j ^ IE„G(Z/pAfz)'^ JJ^ lv«(a)=0 (modp'^O • (8-5) 
ai,...,at: j=i 

A'/:=maxi 

Suppose m > a{p). We split the sum ^^^(^i/pm^^d n!=i P/i,i/'i(a)(P™)P~™' ^'^^'^ residues a 
into two parts according to whether 

t 

Y\_ '^^i(a)^0 (modp«(p)) = 1 or . 
1=1 

First note that for any a with '?/'j(a) ^ (modp'), any lift \E'(a + A;/)-'), A; G [p]'^ is 
component-wise divisible to the same powers of p as ^(a). Hence, Corollary 6.4 implies 



1=1 ^ 

^ae(Z/p°(p)Z)d XI (modp-(p)) 



Thus, the terms of the first type give rise to (8.4). Combining part (a), (b) and (c) of 
Lemma 6.3 yields the general bound 

Pfi,Mo-)iP" 



P k=0 



which shows that terms of the second type are bounded by 

t 

'-i/)i(a)=0 (modp'^i) 



t 

ae(Z/pA^Z)'' X J. 



0^ai,...,at^m: i=l 
M:=maxi ai 



This proves the above expression for Pp. In order to establish the lemma, it thus remains 
to bound (8.5), that is, the sum over divisor densities 



Sp:= Yl a^{p^\...,p''^) . 



ai,...,at 
Af:=maxi ai^a(p) 

Since the coefficients of \E' are bounded, we have 



Q;*(p"S • • • — IE„g(2/pmaxiai2-)d JJ^ lv>,(n)=0 (modp°i) ^ P 



i=l 



which yields 



ai,...,at 
maxj ai^a(p) 
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Recall that 

I02: 102: 

aip) = v,iW) = iC + 1) ^ ^ + 0(1) 

for some sufficiently large integer Ci. Estimating the number of tuples (oi, . . . , at) with 
max, tti = j crudely by j*, we conclude that for p ^ w{n) = log log 

j>Ci (log log Af)/2 logp 

« Yl p~'^" 

j>Ci(loglogAf)/2 logp 

< (log iV) -^1/4 . 

Hence, {a{p)y5p <ti (log A^)^*^^/^, which proves the result. □ 



Proof of Lemma 8.3. We may assume that p is large enough so that p \ Di . . . Df. For 



such primes Lemma 6.3 c) yields 



j3p — lim Eaefz/p^z)'* I I 



t 



1=1 jr>0 

= 5^ a(p"\ . . . n (1 - XdMp'')xdM^) . 

ai,...,at je[t] 

By splitting the sum Yla-^ at terms according to whether no is non-zero, exactly 
one Oj is non-zero, or at least two are non-zero, we obtain for Pp the following. 

(3p= j2 "(p"'' • • • 'P"*) n (1 - XD,(p)p-')xD,(p"o 



n (1 - {1 + E E XD.(p"Op-"| + o( E 

je[i] ^ «=1 a!>0 ^ ^ ai,...a 



a(p'^S...p'^*) . 



at: 
at least 
two ai>0 



Here we used the fact that, for sufficiently large p with respect to t,d and L, we have 
a(p"^, . . . = whenever is the only non-zero exponent. 

It is easy to see that the main term equals 1 + Ot(p^^). Concerning the error term, 
we employ the fact that we are dealing with a finite complexity system of forms. That 
is, since no two forms are affinely related, we have for every p which is sufficiently large 
with respect to t, d, L that 

whenever at least two at are non-zero. There are at most choices of coefficients 
ai, . . . ,at that satisfy maxj = j, and thus the contribution of the error term to the 
value of may be bounded by 



o{J2^f-V-')=Otip- 
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This proves the lemma. □ 
Simultaneous majorant. To summarise, we reduced the task of proving the main 



theorem to that of proving Proposition 8.1 This will carried out by the nilpotent 



Hardy-Littlewood method in the remainder of this paper. In order to apply the method, 
specifically Proposition 10.1 below, we require for every occurring collection of {r^ ^ ^^^^ : 



i = l,...,t}, a G Aq, a pseudorandom majorant that simultaneously majorises all 
r'j ^ i^^y The following function has the required majorant property: 

'^{f^),a ■■ [N/W] ^ M+ , (T^f,),a{m) := Ei^[t]^^ • 

(8.6) 

9. Linear forms and correlation conditions 



In this section we check that the majorant o'{fi),a defined in (8.6) for a collection 
of ly-tricked representation functions '"/j ci(a)' • • • ' '"/t ci(a) (after a minor technical 
modification) indeed a pseudorandom measure, that is, satisfies the linear forms and 
correlation conditions. 

Write M = N/W, let M' be a prime satisfying M < M' ^ OtdiiM), and define 
^U),a ■■ [M'] ^ by 

(^if.)A^) - I 1 if M < n ^ M' . 

As is seen in [8^, App.D], o"*j^) ^ is -D-pseudorandom if the following two propositions, 
which are technical reductions of the linear forms and correlation conditions from [H], 
hold true. 

Proposition 9.1 (D-Linear forms estimate). Let 1 ^ d,t ^ D and let (ii, . . . ,it) G [t]* 
be an arbitrary collection of indices. For any finite complexity system \E' : Z*^ — )• Z* with 
bounded coefficients \\^\\n ^ D and every convex body K C [N^ such that "^{K) C 
[N/W]^, the estimate 

^n&^^nK n ^k ,i^Wi,j{n) + &i,)/3^^„^(irV',(n) + h^) (9.1) 



ie[t] 



holds, provided 7 was small enough. 

Proposition 9.2 (Correlation estimate). For every 1 < mo ^ D there exists a function 
(Jmo : Zm' — ^ with bounded moments E^gz^^, cr^^ (n) <^rn,q 1 such that for every 
interval I C Zm', every 1 ^ m ^ mo and every m-tuple (zi, . . . ,im) £ and every 
choice of (not necessarily distinct) hi, . . . , km E I^m' we have 

provided 7 was small enough. 
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Recall that the ly-tricked majorant i^'c,. ^^iWrn + bj{a)) j3'jj^ ^{Wm + bj{a)) for r'j:,^^ has 
divisor sum structure: 

(log log iV)3 6 log log log TV 

E E E E E2-i^i"i"i"^ 

s=2/7 i=log2S-2 u&U{i,s) d&{'PDj) 

{d,uW)=l 



E ^ ( i!?^ ) 1-.^ I E ( ) 1 1 • (9.2) 




imj,W)=l {s,W)=l 

The function x above is a cut-off. As no characters appear in this section, there is no 
danger of confusion. 

Our strategy to prove the linear forms estimate is as follows. The first step is to 
show that in order to asymptotically evaluate (9.1) we may ignore all terms that arise 
from divisor densities of dependent divisibility events, that is, events {n : Y[ie[t] '^ai\i>i{n)} 
where (oi, . . . , at) are not pairwise coprime. The second step is the observation that the 
densities of independent divisibility events are, up to a small error, independent of the 
system \E' of forms, which will finally allow us to reduce the verification of the linear 
forms condition to the task of verifying it separately for each of the two factors of each 
of the majorants in the case where the : Z — )■ Z is the identity function. The same 
strategy was used in [181 §6] 

The main tool to exploit the divisor sum structure of our majorants is the following 
simple lemma (see [8", App.A] for a proof). 

Lemma 9.3 (Volume packing argument). Let K C [—B, B]'^ be a convex body and a 
system of affine- linear forms. Then 

Yl n ^'^^l^dn) = vol(K)«(rfi, ...,dt) + 0(5^-^ lcm(c/i, . . . , dt)) . 

In order to remove the above mentioned dependent divisibility events, we need to 
replace x by a multiplicative function. A way to achieve this has been found by Goldston 
and Yildirim and was employed and modified by Green and Tao [8] to check the linear 
forms condition for their majorant function for H^-tricked primes. In this respect, the 
proof of Proposition 9.1 below builds on [8^, App.D]. In particular, we shall employ many 
of the small technical arguments from there. 

Recall that the cut-off x was chosen to be a smooth, compactly supported function 
satisfying \x'{x)\'^dx = 1. Let be the modified Fourier transform of defined via 



e^xix) = [ me-'^'d^ ■ 
Jm. 

Fourier inversion, compact support and smoothness of and partial integration yield 
the bound 

m<^A (i + iei)-^ 
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for all y4 > 0. Green and Tao make use of this rapid decay to truncate the integral 
representation of x as follows. Let J = G M : |^| ^ log^''^ N'^}, then for any A > 

log m f i+zg 



^log N^' 



m-&d{^) + OAim'^'^"^^" log" ^ N^) . (9.3) 



This truncation will later-on simplify the process of swapping integrals and summations. 
We proceed to check the linear forms estimate. 



Proof of Proposition 9.1, Define the system $ = {'^j)j<^[t\ : Z"' — t- Z* by '^j{n) : = 
Wipjln) + bi-. A prime p is called exceptional for $ if the reduction of $ modulo p 
has affinely depended forms. For the system defined here, all exceptional primes are 
bounded by w{N) + 0{D). All information we will use about $ are the bound on 
exceptional primes and the fact that it has finite complexity. Consider an arbitrary 



cross term that appears on the left hand side of (9.1) when inserting the definition (9.2) 



and fixing the parameters Sj,ij,Uj for each factor. That is, we consider 

' log dj ' 
logiV7 



j€[t] \ dj€{VnA Vj\uj 



j&[t] \ d,&{VD^) vj\uj 
(dj,UjW)=l 



/logmA / ^ f ^ogcj 



imj,W)=l 




log A^'^', 



E f n 2^^Me.)M4)r(..) n ^(i^))^--^ni.ni 

d,m,e,e' \je[t] r. ^'^ /i ^ 

{dj,mj,ej,e'j} 



Uidimfei\ipi{n) 



where = lcm(ej, and where we denote by bold letters such as d any t-tuple of 
positive If (A^)-smooth integers which we shall implicitly assume to satisfy the correct 
multiplicative restrictions, e.g. di e (Pj) and {di,ViW) = 1 in this case. 

Note that u^djin^Ej = N'^^"''> for all summands with non-zero contribution. Indeed, 
dj,ej,e'j,'mj ^ by definition of the cut-off. We have uj < N'^ by construction of 
the divisor majorant, as the Uj arise as divisors of certain numbers < N"^ (c.f. also 
the remarks following Proposition 4.2 of [H]). Therefore, the volume packing lemma 
implies 

= a^iuidimlsi, . . . , utdtm^^et) JJ 2''t{u,) + O ^Af-^-^+^^^V ^K^)) , 

where the bound 2"^ ^ 2(i°g'°s^)' < allowed to hide the factors 2^^ in the error 
term. 

Since UjdjnVjEj = N'^^'^\ there are only N'^^"'^ terms all together in all sums of the 
majorant, including those over sj, ij and Uj. This and the boundedness of x imply 
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that the volume packing error term has a total contribution of 0(M'^~^+'^'^'^y vol(-ft')) 
towards ( |9.1[ ), and we are left to deal with the main term, that is 

logx 



log A^'' 



Next we show that we may assume that each Ui is coprime to Ujdjin'jej for all j ^ i and 
that {ui.dimfsi) = 1. These properties yield 

a$(Mi(iim^£i, . . . , Utdtrnfet) = a^^dimlei, . . . , dtin^et) . 

Ml . . . 

We shall also abbreviate u = {ui, . . . ,Ut), implicitly assuming that the conditions Uj G 
U{ij,Sj) on these tuples still apply. 

Claim 2. For all choices of {sj)j^[t] and {'ij)je[t] we have 

logx 



Y(^'S>iiuidim^iei),^[t])Yl2'^fi{ej)fi{e'j)T{uj) JJ x 

d,m,e,e' u je[t] a;e 

{dj,mj,ej,e.} 



log N-y 



d,m,e,e' ui,...,ut je[t] ^ ^ ^ ^ 

{dj,mj,ej,e.} 



4. 



where indicates that the sum is extended only over choices {ui, . . . ,Ut) satisfying 
the coprimality conditions {ui,Ujdjm'jej) = 1 whenever i ^ j and {ui^dimlsi) = 1 for 

te[t]. 

Proof. We have to bound the contribution from excluded choices of {ui, . . . ,ut). Any 
prime divisor of any Ui is at least as large as A^^/(i°g'°g^) by construction. Thus, 
whenever the coprimality conditions fail, the divisibility events we are considering are 
included in {n : Y[ie[t] '/'«(^)} some p > A^i/('°si°s^) . By finite complexity and 
the bounds on exceptional primes of we have 

^(log log ]V)-3 ^p^JVf N0-°S log AT) -3 <p<N^ 

We will make use of this with the help of Cauchy-Schwarz. Since 

since is at most 1, we can crudely bound the following second moment 

(\ 2 
d,m,u,e,e' 2:6 V 6 / 

{dj,mj,ej,e-} 



ieftl V \ d,m,u,e,e' / / 



m 

< (logAr)«W22*(i°si°g^)' . 
The combination of these two bounds proves the claim. □ 



LINEAR CORRELATIONS AMONGST NUMBERS WEIGHTED BY Rf 33 

Note that the same argument furthermore shows that the main term from Claim |2] 
equals 

d,m,e,e' je[t] u, ^ 

{dj,mj,ej,e'j\ 

+ Ob(A^~^'°'^^°^^^"*) . (9.4) 



Thus, we are left to deal with the main term in (9.4). We proceed by inserting the 
integral representation (9.3) of each of the At factors involving x- Multiplying out this 
product we obtain a main term and number error terms. Since xii^^^) ^ m~^/'°^^^, 
all these error terms may be seen to be of the same form, which allows us to combine 
them into one error term. Writing Zj^k = (1 + ^^i,A:)/ logiV''' for j G [t],k G [4] and noting 
that \zj^k\ ^ (log A^'^)"^/^, the main term from (9.4) is seen to equal 



d,m,e,e' ^ i€[t] Ui * ^ 

j^- j^W /^(e,)/i(e;)e-^-^e;-^-^rf;^^'^mT^^'^ J] ^(4,) d^. 



je[t] fce[4] 



+OA[\og-^N^\{{e,e'^d 



,m,)-^/^--^ 



The error term here indeed has small contribution: On the one hand, we have 

E En E ^ = 0(1). 

Si,...,stii,...,it j£[t] ^ 
UjGU{ij,Sj) 

See the proof of [IHl Proposition 4.2] for details. On the other hand, the divisor sum is 
bounded: 

d,m,e,e' 

= E n «*(p'^^•••,p"o n {p"^^"'^)-" 

d,m,e,e' p>w{N) j&[t] 
p'^i lld. rrt^e,- a'- , , 

« JJ (l+p-(i+Viog^-))-o(t)<^lQgOW^^ 

p>w{N) 

Here, we crudely bounded the number of occurring t-tuples (ai,...,aj) that satisfy 
maxj = A; by k^^'^^ and apply to each of these tuples the bound aq,{p°'\ . . . , p"*) <^ p~^. 

Thus, when choosing A in (9.5) sufficiently large, the error term above makes a total 
contribution of <^a log^'^''^ A^'''. 

It remains to estimate the main term from above. Changing the order of summation 
and integration leads to an absolutely convergent sum in the integrand. Since the range 
of integration is compact this change is permitted and, hence, the main term is equal 
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to 



[■■■[( E fnE^^)«*(K^^^^Wi^ (9-6) 

\d,m,e,e' ^ ie[t] Ui ^ 

Our next aim is to show that all relevant terms in the integrand are in fact the inde- 
pendent terms, that is, they are those terms for which the t products Uidim^Ci, i G [t] 
are pairwise coprime. This will eventually allow us to swap the sums with the product 
while only introducing a small error. For the Ui we have just done this. 

Since each entry of d, m, e, and e' is completely composed of primes ^ w{N), the 
following claim holds. 

Claim 3. We have 

d,m,e,e' je[t] 

d,m,e,e' je[t] 

where indicates that the summation is extended only over choices of t-tuples that 
satisfy the coprimality condition {dirriiei, diimiiSii) = 1 for any i ^ i'. 

Proof. Note that the summand is multiplicative and may be written as a product over 
primes p > w{N). Any summand 

a<^{{dim'^iei)ie[t]) Y[ n{ej)n{e'j)ej''-'e'j~'''^dj'''''mj'''^ 

with entries failing coprimality may be factorised into a product of one factor of the 
same form that satisfies coprimality and one factor for which every prime p that appears 
as a divisor of some dimiSi divides at least another di'mi'Ei', i' i. For a fixed tuple 
{ki, . . . ,kt) of the latter type (that is, p\ki implies p\ Yii'^^i contribution may be 

bounded as follows employing the triangle inequality: 



a{ki, ...,kt] 



3, 3, 



d,m,e,e' j6[t] 
(dimiei, ki...kt)=l 



= a{ki, ...,kt) 

p\ki...kt d,m,e,e' je[t] 

^ a{ki,...,kt) 



p\ki...kt 



Zj,3 —Zjd 

J 
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Next, we bound the sum over all occurring terms a{ki, . . . ,kt) Y[p\ki kS^ + ^(^' ^))- 
Written as a product over primes, a crude bound for this quantity is given by 

n {l+ E 0{at + --- + at)a^{p^\...,p^^){l + Oip-'))]-l, 



p>w(N) 



cti,...,at: 

at least two 
ai>0 



where we used the very crude bound r^ip""^) ^ af on the generalised divisor function t^. 
The five factors correspond to di, m^, 5j/ej, ei/e[ and eie'Jei. To further bound the above 
expression, we observe that the number of tuples (ai, . . . , at) with maxj ai = k is at most 
tk^~^. For such choices of (ai, . . . , at), we have af ^ tk'^ and Q;$(p"\ . . . ,p°'*) ^ p~^~^, 
since $ has finite complexity and at least two of the are non-zero. Further, for large 
enough p, we have p~^t^k*~^{l + 0{p'^)) < p~'^^/^ for all k ^ 1. We certainly may 
assume that is large enough for p > w{N) to satisfy this condition. Thus 



J2 0(af + --- + a^)a<i,(p'^i, 



at 



ai,...,at: 
at least two 
ai>0 



As 



1^ J2 r2-3/2 < ^(Ar)-V2 

n>w(N) 



n i^+p-'^': 

p>w{N) 

the result follows. 

Note that in the above claim 

^ a^{{dim^i€i)i(.[t]) JJ /i(ej)/i(e^)ej'''''e^"^'''rf 



□ 



Zj,3 ~^iA 

3 



d,m,e,e' 



(9.7) 



d,m,e,e' je[t] 

holds. The last step of the rearrangement is to show that we may swap the inner product 
and sum in the integrand. 



Claim 4. The sum and product in (9.7) may be interchanged: 



E n 

d,m,e,e' j£[t] 



^i{ej)^i{d: 



"i '"'j 



;l + 0(^.(iV)-V^))^ E ^^^^^e/-e 



dj ,mj ,ej ,e'j 

Proof. The proof of this claim is essentially the same as the one of the previous claim. □ 



The next claim will imply that the integral (9.6) equals, up to a small error, the 
integral of the main term from Claim |4} 

Claim 5. 



n E 

j&[t] dj ,mj ,ej ,e'^ 

0(1) . 



/i(ej)/i(e;.; 



fee [4] 



e[t]x[4] 
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Proof. (Cf. [8], equation (D.23) and the proof thereof.) We begin by writing the integ- 
rand as a product over primes 



-|-|- ^ Kej)f^{e'j 



'3 '^j fij 



jeM d. 

im q&Qj 



n ^(o.'^^ 

fee [4] 



-1-2 



J,2 



P<^Vj /£e[4] 



By the prime number theorem in arithmetic progressions, we have for > 



p6Si 



^log^ + Oz..(l) . 



This also holds for Vi in place of Qi. Thus, choosing A sufficiently large the above is 
seen to be bounded by 



ie[t]A:e[4] geQj pdVj 

« iog*iv^iog-*iv^ + + n (1 + i^^'-'^i)"'" 

ie[t] fce[4] 

For any A > 2 the integral of the final expression is 0(1). 



□ 



Together with Claim [sj equation (9.7) and Claim |4| the above Claim implies that the 
integral (9.6) is given by 



n E 



1 J I ^ ^ E j 



fce[4] 



e[t]x[4] 



Ui 



+ 0(1). 



Removing the truncation of the integral again, the latter expression is seen to equal: 

logx 



dj ,mj ,ej ,e • 



{ej,e'.,mj,dj} 



log A^-^ 



) + o{l). 
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Putting everything together, we have shown that 

= il + OdiwiN)-'/')) 

n(^^^^ 2^^r(Mi)/i(ej)/i(e;.) log a; 

{ej,e^,mj,dj} 

The last expression now is independent of $. Applying the asymptotic in each of the 
known one-dimensional cases 

where $ : Z — )■ Z is given by the identity, implies that each of the factors above is of 
the correct form. This completes proof of the Proposition. 

Proof of Proposition |9.2[ The proof of the correlation estimate follows in a very 
similar manner to those of the corresponding estimates for the divisor function major- 
ant in [TBI §7] and the von Mangoldt function majorant from [S] App.D]. We restrict 
attention to the case of pairwise distinct hf, the remaining case follows, as before, by 
choosing (Jmo(O) sufficiently large. Employing the volume packing lemma, we may show 
as in [IHl §7], that 

je[m] p|A ai,...,am 

p>w{N) 

where A := Yljjtfi^i^j ~ ^j') + ~ \/)- This estimate allows us to proceed as in 

m §7]. 

10. Application of the transference principle 

This section provides a quick overview of the results around the von Neumann theorem 
and the inverse theorem for the Gowers norms. We apply these results in the end of the 
section to reduce Proposition to a non-correlation estimate. 

In the dense setting, that is, if : Z — t- M is a bounded function with asymptotic 
density, the Gowers uniformity norms, defined as 

1/2" 

kllf/^fAf] '■= \'^xe[N]^he[NY 

capture all information on the correlations of g with respect to finite complexity systems. 
This generalises as follows. 



Yl g{x + u-h)] 

G|0,1}= / 



Proposition 10.1 (Green- Tao [8j, generalised von Neumann theorem). Let t,d,L be 
positive integer parameters. Then there are constants Ci and D, depending on t, d and 
L, such that the following is true. Let C , Ci ^ C ^ Ot^xi^), be arbitrary and suppose 
that N' G [CA^, 2CA^] is a prime. Let v : Z^' — > be a D -pseudorandom measure, 
and suppose that fi, . . . , ft : [A^] — t- M are functions with \fi{x)\ ^ ui^x) for all i e [t] 
and X e [A^]. Suppose that \l/ = {ipi, . . . ,ipt) is an affine-linear system of forms whose 
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linear coefficients are bounded by L. Let K C [— A^, A^]'^ be a convex body such that 
'^{K) C [A^]*. Suppose also that 

min ||/,||^*-i[^] = o(l) . (10.1) 

Then we have 



Establishing the Gowers-uniformity condition (10.1) itself is a task that is concep- 
tually equivalent to that of finding an asymptotic for IliGit] /('^«('^)) directly, 
and should therefore not be any easier. The specific system of affine-linear forms that 
appears in the definition of the uniformity norms, however, allows an alternative char- 
acterisation of Gowers-uniform functions. 

A characterisation of Gowers-uniform functions. Whether or not a function / 
is Gowers-uniform, is characterised by the non-existence or existence of a polynomial 
nilsequenc^ that correlates with /. On the one hand, correlation with a nilsequence 
obstructs uniformity: 

Proposition 10.2 (Green- Tao [8], Cor. 11.6). Let s ^ 1 be an integer and let 5 G (0, 1) 

be real. Let G/T = (G/F, dc/v) be an s-step nilmanifold with some fixed smooth metric 
dc/T , and let {F{g{n)V))n&i be a bounded s-step nilsequence with Lipschitz constant at 
most L. Let f : [A^] M. be a function that is bounded in the Li-norm, that is, assume 
= E„g[Ar]|/(n)| ^ 1. If furthermore 

E„e[iv]/(^)i^(^7(ri)r) ^ 6 

then we have 



An inverse result to this statement has been known as Inverse Conjecture for the 
Cowers norms for some time and has recently been resolved, see [11]. The inverse 
conjectures are stated for bounded functions. With our application to the normalised 
divisor function in mind, we only recall the transferred statement, c.f. [HI Prop. 10.1], 
here. 

Proposition 10.3 (Green- Tao-Ziegler, Relative inverse theorem for the Cowers norms). 
For any < 5 ^ 1 and any C ^ 20, there exists a finite collection A4s,s,c of s- 
step nilmanifolds G/T , each equipped with a metric dc/v, such that the following holds. 
Given any N ^ 1, suppose that N' G [GN,2GN] is prime, that v : [N'] — )■ IR+ is an 
{s + 2)2'^^^ -pseudorandom measure, suppose that f : [N] — )■ M zs any arithmetic function 
with \f{n)\ ^ z/(n) for all n G [n] and such that 



s+i[N] ^ 5 . 

Then there is a nilmanifold G/T G Ms^s,c ^he collection and a 1-bounded s-step 
nilsequence {F{g{n)T))n(zfq on it that has Lipschitz constant Os,s,c{^), such that we have 
the correlation estimate 

|E„e[;v]/(^)i^(^7(^)r)|>.,5,cl. 



^For definitions of nilmanifolds and nilsequences, see, for instance, [lOj . 
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This inverse theorem now reduces the required uniformity- norm estimate (10.1) to 
the potentially easier task of proving that the centralised version of / does not correlate 
with polynomial nilsequences. 

10.1. Reduction of the main theorem to a non-correlation estimate. We already 



reduced the main theorem to the W^-tricked version given in Proposition |8.1[ which we 
now restate: 



Proposition 8.1 Let \E' : Z'^ — )■ Z* be a finite complexity system of forms, let a G A^, 
and let ^-.1/^1} he the translate of \E' defined as in Section Then 

t 

E„^ezdni^' JJr}^,,^(„)(V^(m)) = 1 + Ot,d,L{l) , 

i=l 

where K' C [N /Wf is a convex body such that W^{K') + c(a) C [1, A^]*. 
Writing 

t t 

i=l 1=1 

and multiplying out, we obtain a constant term 1 and all other terms are of a form the 
generalised von Neumann theorem applies to, provided we can show that 

\\r'f,M<^) - M\u^-^ =o{l) 

for all i G [t]. By the inverse theorem, it suffices to show that 

\Ke[N/W]ir'f^Ma){n) - l)F{g{n)T) \ = OG/r,t(l) 

for all {t — 2)-step nilsequences {g{n)T)^^j^^^ and 1-bounded Lipschitz functions F. 
This task will be carried out in the sections flil - fTSl 

11. NON-CORRELATION WITH NILSEQUENCES 

The so far standard line of attack to obtain a result of the form 'the function h does 
not correlate with fc-step nilsequences' is to employ the Green- Tao factorisation theorem 
Pm 1.19], which allows us to reduce this task to the case where the nilsequence is close 
to being equidistributed. A separate estimate which shows that h does not correlate 
with periodic (nil) sequences allows us to further assume that the Lipschitz function 
involved has zero mean, that is, /^/p-F = 0. Periodic sequences are regarded as major 



arcs. We have already deduced a major arc estimate in Section The remaining 
case with the strong assumption that the nilsequence behaves in a very equidistributed 
way corresponds to the minor arc analysis of the classical Hardy-Littlewood method, 
cf. the discussion in [HI §4] . The procedure of passing to the equidistributed (minor arc) 
case is fairly independent of the individual problem and is completely described in §2 
of [9]. Thus, we restrict our attention here to provide the necessary major and minor 
arc estimates specific to our problem and only summarise the procedures from [9] we 
employ. 

Our approach to the minor arc estimate is modelled on a strategy one might choose 
in the classical setting: If ^ is a rational that belongs to a suitably chosen notion of 
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'minor arc', then one obtains an upper bound for the expression 

En^Nrf{n)e{9n) = ^ e{9f{x, 1/)) = ^ Yl ^(^(aa;' + bxy + cy^)) 

x,y : x,y : 

fix,yHN fix,yHN 

by sphtting into suitable summation ranges, fixing either x or y, and applying Weyl's 
inequalitjj^ Thus, in our case, we aim to employ the quadratic structure of the form / 
by means of Weyl's inequality in order to deduce the estimate 

Er,^Nrf{n)F{g{n)r) = o(l) 

for sufficiently equidistributed sequences {g{n)T)n^j^. When working with a sequence 
(F(5f(n)r))„g[7v] directly, Weyl's differencing trick may only be employed locally on so 
called generalised Bohr neighbourhoods, where one can make the locally polynomial 
structure of a nilsequence explicit, cf. the approach in [7]. 

The crucial fact that makes Weyl's differencing trick work for exponential sums is the 
fact that the exponential function is a group homomorphism. Since F is a Lipschitz 
function, one expects it to have a good, i.e. short, Fourier approximation. In general, 
elements of a Fourier basis in the non-abelian case arise from characters, i.e. homo- 
morphisms. Thus there is a good chance that it is possible to employ Weyl's inequality 
globally for elements of the Fourier basis and hence for a short Fourier approximation 
of a Lipschitz function. 

In our case, the situation is considerably simplified by the availability of a complete 
quantitative equidistribution theory for polynomial orbits on nilmanifolds, which has 
been worked out by Green and Tao in [10]. In particular, their generalisation of Leon 
Green's theorem ('Quantitative Leibman theorem' [I0| Thm.1.16]) asserts that any poly- 
nomial sequences on a nilmanifold G/T is 5-equidistributec|^ and only if its projection 
on the horizontal torus is ^'-equidistributed, where the dependence is polynomial. The 
horizontal torus bears the advantage of being isomorphic to an ordinary torus M'^^^'^/Z'^'''^. 
Consequently, we need not consider the representation theory on nilpotent Lie groups 
and their homogeneous spaces; analysing the projected sequence on the horizontal torus 
by standard Fourier analysis, or even the quantitative version of Weyl's equidistribu- 
tion theory, is sufficient. (The latter theory will actually reduce matters to looking at 
sequences Z — )■ M/Z arising from horizontal characters.) 

Our strategy, after reducing to the equidistributed case, is the following: Let P denote 
a polynomial of degree d. Then equidistribution of {g{n)T)n<N on G/T implies that 
{7iog(^n))n<N is equidistributed on the horizontal torus, which implies, as a consequence 
of Weyl's equidistribution theory, that (tt o g[P{n)))^^^i/d is equidistributed on the 
horizontal torus, which implies that (5'(P(n))r)„<^i/d is equidistributed on G /V. The 
distribution of polynomial subsequences was not considered in [T^ , but will follow from 
results of that paper. These results will be proved in sections [14] and [15] 

For the above strategy to work, a strong major arc analysis is required, because the 
VT-trick introduces very large coefficients into the quadratic forms under consideration. 
For the major arc analysis, we rely on the observation that all of these large coefficients 
turn out to be entirely composed of small prime factors. We briefly describe in the next 
section how this information is used to choose major and minor arcs in the classical 



setting. The general case will be carried out in Section 15 (especially Corollary 15.2 and 



"'See the next Section for more details. 

^The quantitative notion of equidistribution is recalled in Section 14 
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Proposition |15.4 which deal with polynomial subsequences that have large but smooth 



coefficients) and Section 16 , which provides a factorisation of polynomial sequences into 
major and minor arcs. 

12. A SPECIAL CHOICE OF MAJOR AND MINOR ARCS IS NECESSARY 

In this section we describe briefly and solely for motivational purposes how the major 
and minor arcs are chosen in the model case of correlation with linear phase functions 
e{9n) instead of general nilsequences. Here the task is to show that 

^n^Nir'f^fsin) - l)e{0n) = o(l) . 



In Section 7.1, we saw that (r^^ — 1) does not correlate with any g-periodic function 
of w(A^)-smooth period q, provided N/q is still quite large. It is therefore possible to 
choose the major arcs to consist of all rationals 6 G [0, 1) that are close to a rational 
with w(A^)-smooth denominator: in that case e{n6) is close to a periodic function with 
w(A^)-smooth period. The minor arcs then comprise all 6 that are not close to rationals 
with w(A^)-smooth denominators. For such a 'minor arc' 6, we automatically have 

Thus, we deflne the major arcs to be 

9Jt := U 9Jt, , 

where H is the following set of all not too large w(A^)-smooth denominators 

£1:={1 ^q^ N' -.plq =^ w{N)} 

and where is the set of real numbers that are well approximated by some rational 
with iy(A^)-smooth denominator: 

^ — -r; — for some (a,q) = l\ . 
qN^-^ ^ ^ i 

The reason behind this choice of major arc is the following. When we pass to W- 
tricked versions of the representation function, which are up to normalisation of the 
form n t— )■ rf{Wn + then this restriction to a linear substructure cannot directly be 
expressed by the quadratic form /. For the minor arcs treatment, we, however, hope to 
work with the quadratic form directly. We will therefore consider all choices {x',y') G 
[W]"^ such that f{x',y') = (3 (modW^) and consider for each choice the quadratic form 
f{Wx + x', Wy + y') in x, y. Fixing either x or y, we hope to apply Weyl's inequality 
when 6' ^ 2Jt to estimate 



{.: 









J2 r{Wn + /3)e{9n) 

ns^(N-l3)/W 



x',y'&[W]: ,— -^jI 



d{f{Wx + x', Wy + y') - (5) 

w 



Here we obtain for flxed x\ y' and either flxed x or flxed y a quadratic inside the 
exponential with leading coefficient QWa or QWc where a and c are coefficients of /. 
For the application of Weyl's inequality, we require that this leading coefficient is close 
to a rational with large denominator. 
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Since acW <^ A^°(^\ the choice of major and minor arcs guarantees that, when 9 971, 
i.e. 



1 



qN 



l-e 



for some q that has a prime factor > w{N), or satisfies q > N^, then 

a' 



aW9 

q' 



1 



where q' has a prime factor > w{N), or satisfies q' > N'^~°^^\ Thus, aW9 can still be 
thought of as minor arc, when replacing by N^~°^^\ 

13. A BRIEF OVERVIEW OF THE CONCEPTS AROUND NILSEQUENCES 

Let G be a connected, simply connected, fc-step nilpotent Lie group, and let F be a 
discrete co-compact subgroup. Then G/T is called a fc-step nilmanifold. A filtration G, 
of G is a sequence of subgroups 

G = Go = Gi ^ G2 ^ . . . ^ ^ G,+i = {idc} 

such that for any d ^ i, j ^ the commutator group [Gi,Gj] is a subgroup of Gj+j. 
The filtration is said to have degree d, if Gd+i is the first element in the sequence that 
is trivial. By definition, a nilpotent group always has a filtration. 

The quantitative analysis carried out in |10] relies on the existence of a certain type 
of basis, a Mal'cev basis, for the Lie algebra q of G. Adapted to any filtration, there 
exists a Mal'cev basis for g that parametrises via the exponential map both the groups 
in the filtration and the uniform subgroup F in a very natural way. For each such basis 
X, Green and Tao introduce a metric dx for G and its quotient G/F in p]0| Def.2.2], 
which then allows them to define Lipschitz functions on G/F, and also allows them to 
introduce a notion of slowly varying (or smooth) sequences {e{n))n£z that take values 
in G. Despite the fact that any of the statements on nilsequences require a fixed choice 
of Mal'cev basis A" and corresponding metric rf^', we will not need to directly work with 
any of the specific properties of either of these objects: they will only implicitly be 
present through the results from [TU] we build on. For this reason, we content ourselves 
to refer to [10_, §2 and App.A] for background and exact definitions. 

Definition 13.1 (Polynomial sequence; Def.1.8 ^Q\). Let g : Z ^ G be a G-valued 
sequence, and define the discrete derivative dhg{n) := g{n + h)g{n)^^ for each /i G Z. 
The g is a polynomial sequence with coefficients in G,, when for every i G {0, . . . , d+1}, 
and every choice of hi, . . . , hi G Z all i-th derivatives satisfy dh^ . . . dh^g{n) G Gj. We 
write poly(Z, G,) for all polynomial sequences adapted to G, and say they are of degree 
d, where d is the degree of the filtration. 

Two facts about polynomial sequences are of particular importance. The first is a 
theorem of hazard: poly(Z, G,) forms a group; see [ini §6] for a proof and the reference 
to the original work. The second important property is a more explicit description of 
polynomial sequences. It is shown in [10] (cf. §6 and the remarks following Def.1.8) that 
every polynomial sequence can be written in the form g{n) = a^^^"''' . . . a^* where k is 
some integer, ai, . . . , G G, and pi, . . . : Z — )■ Z are polynomials. Observe that, if 
the sequence gi defined by gi{n) = '•"^ belongs to poly(Z, G,), then the assertion that 
the discrete derivatives of order d + 1 all equal id^ directly translates to deg(pj) ^ d. In 
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general the degree of the polynomial sequence g is much larger than the degrees of the 
polynomial exponents pi, . . . ,pk that appear in the above mentioned representation. 

Definition 13.2 (Horizontal torus). Write n : G ^ (G'/r)ab := G/{[G,G]T) for the 
canonical projection of G on the abelianisation ofG/T. (G/r)ab is called the horizontal 
torus ofG. 

We will extensively work with horizontal characters : G — t- M/Z. These are additive 
homomorphisms that annihilate F. Note that when g has degree d, that is, when g has 
coefficients in a filtration of degree d, then the projection r] o g can be written as an 
ordinary polynomial of degree at most d taking values in M/Z. 

[Tin Def.2.6] defines the notion of the modulus \ri\ of a horizontal character. All that 
is important to us, is that H^^IIlip ^ I'?!- 

14. Reduction from nilmanifolds to the abelian setting 

In this section we provide the tool for passing from a general nilmanifold to the abelian 
setting of the horizontal torus. We caution, however, that by far the largest amount of 
the real work behind these results is hidden in the application of fiUi Thm 1.16], while 
the converse statements we prove are fairly straightforward. 

Integral to all what follows are the two quantitative notions of equidistribution that 
were introduced in [TDl Def. 1.2]: 

Definition 14.1 (Quantitative equidistribution, [10]). Let G/T be a nilmanifold en- 
dowed with Haar measure and let 61,62 G (0,1) be parameters. A finite sequence 
{g{n)r)n^]S[ is said to be 5i-equidistributed if 

E^r.elN]F{g{n)T) - [ F 
Jg/t 

for all Lipschitz functions F : G/T — )• C with 

\F{x)-F{y)\ 
x,yGG/r,xf^y dc/vix^y) 
{g{n)V)n^jq is said to be totally 52-equidistributed if 



lup := lli^lloo + sup 



^ ^ill^l 



Lip 



E„gpF(f7(n)r) - / F ^ 52||F||Lip 
Jg/v 

for all Lipschitz functions F as above and all arithmetic progressions P C [A^] of length 
\P\ ^ 62N. 

For polynomial sequences these two notions of equidistribution are equivalent in the 
sense that every totally (52-equidistributed sequence is 52-equidistributed, and every 61- 
equidistributed sequence is totally 52(^1 )-equidistributed, where 6f ^ 62(61) ^ 61 for 
some A only depending on the degree of the sequence, and the dimension and step of 
the nilmanifold. (As this observation will not be used later on, a proof is omitted.) 

We set out by recalling the quantitative version of Weyl's inequality from [lOJ, and 
the notion of smoothness norms in terms of which this inequality is phrased. 

Any polynomial (7 : Z — ?■ M/Z of degree ^ d has an expansion of the form 

/ fi\ / ji 

g{n) =ao + ttiLj H ^"^(^ 
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The smoothness norm of g is defined by 

||fi'||c"«[Ar] := sup N^\\aj\\M/z ■ 

This norm was introduced in |TU1 Def. 2.7] as a measure of slow variation of polynomial 
sequences on tori. Indeed, 

\\g{n) - g{n - 1)||k/z <d \\9\\c--[n]/N (14.1) 

holds. For us it will be more convenient to work with the coefficients of the ordinary 
representation of g. When g{n) = f3dn'^ + f3ci-in'^~^ + ■ — \- f3o, then (cf. |9l Lemma 3.2]) 
there is g ^ 1 with q = 0^(1) such that 

Ik/Silk/z <iV"i^||c-[7V] (14.2) 

for j = 1, . . . ,d. This follows by expressing each Pj as a linear combination of ctj. The 
coefficients appearing are bounded by 0^(1). 
In the other direction we can show 

Ikllc-ITV] < sup N^\\j\(3j\\u/z . (14.3) 

Indeed, jiPj is a linear combination of a^, i ^ j, where the coefficient of aj is 1 and 
all other coefficients are 0^(1). Let jo be the maximal index for which ||5'||c°°[Af] = 
A^^o||ajJ|. Then N'\\ai\\ < N^°\\ajJ for all i > jo- Thus < N^^-'WajJ. Then 
llio'/^joll = llc'^jolKl + Od{N~^)), which proves the result. 

Part (a) of the following is Green and Tao's Proposition 4.3 from (TU]. While the latter 
is quite a deep result, its converse, which we prove as part (b), is rather straightforward. 

Proposition 14.2 (Weyl). (a) Suppose that g : Z ^ is a polynomial of degree d, 
and let < 6 < 1/2. // {g{n) (mod Z))„g[iv] is not 6-equidistributed in M/Z, then there 
is an integer k, 1 ^ k <^ §-Od{i) ^^^/^ ^/^^^ ||%||c°°[iv] ^ S''^'^^^^ 

(6) Suppose that the parameter 6 = S{N) G (0, 1) satisfies 5^* <^t N for all t E N. 
Further, suppose there are positive integers ki, . . . ,kd satisfying kj ^ 6^'^'' ^ such that 

\\k,a,\k/^^6'^"'/N^ . 

Then, provided N is large enough, there is some positive integer A = 0^(1) such that 
{g{n) (mod Z))„g[Ar] is not totally 6^-equidistributed in M/Z. 

Remarks. (1) The precise choice of exponents in the bounds ^ is not important to 
this result, but we will later make use of the fact that this way k^kd-i ■ ■ ■ kd-j ^ 5^^^ 

(2) In part (6), the conditions ||A;jaj||R/2 ^ 5'"^"^ ^ /N^ can be replaced by conditions 
of the form \\kjl3,\U/^ ^ 5-^'-' /N^ as they imply || A;,j!/3, ||m/z < 5-^''' /NK 

Proof. All that is left is to prove part {h). Put k = \cm{ki, . . . , kd). Then, by the 
assumption on 6, 

\\kaj\\^/z^S-^'/N^ = o{l) 
for some A' = 0^(1) for each j G [d]. Consider the sequence 
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By (14.1), each g{kn) in the range satisfies '^g{k) — g{kn)'^ <ti 5^' . Thus e o g 



exp{2TTig(-)) is almost constant on this range and we obtain for sufficiently large 



E^^^N^2A'^e{g{kn)) 



e{x) dx 



> 1 



I Lip ' 



□ 



that is, {gin) (modZ))„^7v is not totally jk = ^'^'^'^-'^^-equidistributed. 

The equidistribution of nilsequences is related to the equidistribution of certain poly- 
nomial sequences via the following projection theorem. 

Proposition 14.3 (Green- Tao 'Quantitative Leibman theorem'). Let m,d,N be posit- 
ive integers, and let 6 & (0,1/2) be a parameter. Let G/T be an m- dimensional nilman- 
ifold together with a filtration G, of degree d and a -rational Mal'cev basis adapted 
to this filtration. Suppose that g G poly(Z, G,). Then there are positive constants B 
and B' , only depending on m and d, such that the following holds. If {g{n)T)n<^N is 
not totally 6-equidistributed in G/T , then there is a non-trivial horizontal character rj 
of modulus \r]\ <^ 5-Om,d(i) such that [r] o g{n))n^]sf is not totally 5^ -equidistributed in 



Conversely, if there is a non-trivial horizontal character rj of modulus \ri\ ^ 6^^ such 
that (rj o g{n))n^N fails to be totally 6-equidistributed in M./Z, then {g{n)T)n^N is not 
totally 6^ -equidistributed in G/T. 

Proof. If {g{n)T)n^i\i is not totally (5-equidistributed, then there is a progression P = 
{po,Po + 1, ■ ■ ■ ,Po + ^q} of length at least 6N such that the sequence {g{n)T)n£p fails 
to be (5-equidistributed. Define g' G poly(Z, G.) by g'{n) := g{qn + po). Then [TOl Thm 
2.9] implies that there is a non-trivial horizontal character t] : G — »■ R /Z of modulus 
\r]\ ^ S-Orn,dW such that |ho5''||c°°[<5Ar] ^ By Proposition 14.2 'b), this implies 

that rj o g fails to be totally 5'^-equidistributed for some B = Om,d{l)- 

In the other direction, if there is a non-trivial horizontal rj of modulus bounded by 
6^^ such that {rj o 5((n))„^Ar fails to be totally 5-equidistributed, then we again find a 
progression P = {po,Pq + q, ■ ■ ■ ,Po + ^q} of length at least 6N such that the sequence 
(?7 o g{n)T)r,^p fails to be 5-equidistributed. By Proposition 14.2[ a) we have 

\\v°9iPo+J(l)-V°9iPo + U - l)q)\\R/z < 5-^™-^(^V^ 



(14.4) 

\. Since t] is an additive character on a compact group, we have 
0. Consider the subprogression P' = {po, po + <?,•••, + ^'l} C P, 



for all j G {1, . . . , 

where i' = 6^ N, with B' = Om,d{l) large enough so that (14.4) guarantees 



\V o 9{Po) - V o 9{Po + jq)\\m ^ jr^'.d) ^ ^ 



for all j, ^ j ^ This implies 

EneP'eii] o g{n)) - 



IE, 



neP 



e{r]og{n))\ > 



1 



e{ri{x)) dx 

I G/T ^ 

using the fact that 3fJ(e(a:)) = cos(27rx) ^ 1 — (27rx)^ > 1 for x ^ ^. 

Since ||e o ?^||Lip(G/r) < ||e||Lip(iR/z) ||'7||Lip(G/r) < 5-C'-><i(i), where the bound on the 
Lipschitz constant of rj comes from the bound on the modulus (cf. [101 Def.2.6]) of the 
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character, we may in fact choose B' = Om,d(l) large enough to ensure that also 

fS'ii II 

-> d lleor^llLip 

holds. Thus, {g{n))n<N is not totally 5^'-equidistributed in G/T. □ 
15. Equidistribution of polynomial subsequences via Weyl's inequality 



With the help of the quantitative Leibman theorem (Proposition 14.3), which reduces 
questions about the equidistribution of polynomial nilsequences to questions about the 
equidistribution of polynomials taking values in M/Z, we analyse in this section the 
distribution of polynomial subsequences of polynomial orbits. 

The first result states that on the torus polynomial subsequences of 5-equidistributed 
sequences are equidistributed too. Before stating this proposition properly we give an 
informal description of its contents here. A polynomial : Z — t- M/Z is equidistrib- 
uted if and only if one of its coefficients is irrational. Quantitative equidistribution is 
an assertion on whether or not there is a Lipschitz function F : M/Z — )■ C for which 
\^n^NF{g{n)) — Jjgy^ F\ fails to be small. Approximating the Lipschitz function F by a 
Fourier series, one sees that studying this quantity is equivalent to studying the expo- 
nential sums E,n^]^e{ug{n)) for certain rational uj. The latter is naturally approached 
by Weyl's inequality which then shows that the quantitative equidistribution of g is an 
assertion about whether or not there is a coefficient of g that is not close to a rational 
with small denominator. This rational approximation property is preserved when we 
consider compositions g o P of g with an integral polynomial P whose leading coeffi- 
cient is not too large. To see this we only need to consider the case where g has a 
'highly irrational' coefficient. Take the largest-index coefficient of g which is 'highly 
irrational' and call it Pi^. Then we may check that the largest-index coefficient oi g o P 
which arises from the highly irrational coefficient of g is still considerably irrational. 
(Some bounds on the lower coefficients of P are needed in order to avoid cancellation.) 

Proposition 15.1 (Equidistribution of polynomial subsequences: Abelian case). Sup- 
pose that (7 : Z — 7- M zs a polynomial of degree d and that P{n) = X]f=o^*'^* ^ 
polynomial with integer coefficients of degree d' such that the leading coefficient 7^' is 
bounded by Lq, while all other coefficients satisfy the inequality 7^ ^ A^'^'^ . Let 
< 5 < 1/2 and suppose 5"* -Ct A^ for allt &N. Then there is some integer A = 0^(1) 
such that when {g{n) (modZ))„g[Ar] is totally 5 -equidistributed and when Lq ^ 5~^^^, 
then {g o P{n) (modZ))^gjjyi/d'] is totally 5^^^'^''^'^'''^ -equidistributed. 



Proof. Since g is totally (5-equidistributed, Proposition 14.2[ b) implies that there is an 



integer A' = Od(l) such that no cZ-tuple of positive integers ki, . . . ,kd satisfies simultan- 
eously kj < S-^"'"/^' and \\kjPj\\ < 5-"^''' /N^ for all j = l,...,d. We deduce that 
there is some index among them such that ||fcjo/3jo|| ^ 5^^'' '"M'jy-io (iocs not hold 
for any ki^ <^ 'oM'_ Suppose io is maximal with this property. Then for all i with 
io < £ ^ d we find K£ <^ S^^ such that 

^ b-^'-'I^N-' . (15.1) 
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For any j G {1, . . . ,d}, considering the jth term of 

d 

Y,PAnn)r =goP{n) 

we have 

where Qj{n) is a polynomial of degree ^ jd' — 1 such that the coefficient of n* for any 
i is bounded by Od4'{N^ N^'-/'^' S'^/"^) since 

d' 



t=i 



E 



{ti,...,ij)e[<i']J 



and 



Define cxj, i = 0, . . . , dd' , to be the following coefficients 

dd' d 

Comparing coefficients, each aj may be written as a linear combination of /3j with 
j ^ i/fi'; cTjrf/ is the a-coefficient of largest index whose representation in terms of /3's 
contains which appears with coefficient (7^')"' the representation. 

Next, we aim to show that there is A" = Od^d'i^) such that every choice of /ci, . . . , kdd' 
with kj ^ (5-2'*'' ^Qj. gac]^ j ^ {!,..., dd'} contains some kj^ such that 



This, when applied with kj = qk for any k ^ 6 ^M", would in view of (14.2) imply 
\\kjg o P||(^oo[^i/d'] ^ from which the result follows by Proposition 



14.2 



a . 



We will show that we can pick jo = iod'- Thus, suppose for contradiction that 

||fc.o,.a„,,|K'5-'"'""''/^"iV-^° (15.2) 
holds for some ki^d' ^ ^-'^'^'^ '""^ M"_ ^Q^g 

kiod'(Jiod' = ki^d'ild'Y" Pio + "^ki^d'CtPt , 

£>io 

where the Ci are integers of order Od4'{N^~''°5~'^/'^) as can be deduced from the equation 



7ti • • • lu 



ti,...,tieK] 

<iH Hi=iod' 



We wish to discard all the terms with I > i^'ui the above expression for kif^d'O'igd' in order 
to deduce that is well approximable by rationals which will hopefully lead us to the 
sought for contradiction. Thus, in view of (15.1), we multiply the above expression for 
kiod'CTiod' by K := Yli,^^^ ki. Inequahty ( |15.2D yields 



\^^h,d'(y^,d'\\ « r2'''-"»''/A"5-(20+...+2'*-o-i)M'^- 



5- 
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Writing k = h^d' {l d'Y^' n<!>io ^^^^ 

e>io i>io 



where in view of (15.1) and the bound on the Ci 

_ ^-2<*'*'-*0'*'/A"^-(2'*-*0-l)/A'^-d/A^-io 

Recalhng that 5"* <^t N for all t G N, this upper bound is seen to be o(l). Together 
with the bound on H/tA^iod'Ciod'H this allows us to employ the triangle inequality provided 
is large enough that no wrap-around issues can occur. In particular, this allows us 
to deduce that 

Choosing A" = 2'^'^'+^A' and A = 2dA' (to ensure that ^ b^'^l^ ^ (^-1/(2^')) this 
translates to 

while we obtain the following bound on k 

Hence, we obtained a contradiction to the rational non-approximability properties of 

Ao- □ 

Next, we slightly extend this result. Consider the binary quadratic form f{x,y) for 
fixed y and its restriction to subprogressions modulo q in the x variable: 

f{qx + r,y) = aq^x^ + x{2aqr + hqy) + (ar^ + hry + cy"^) . 

This defines a quadratic polynomial P{x) := 72^^ + 71X + 70 := f{qx + r, y) in x. Being 
interested in {x,y) such that f{x,y) ^ A^, we may suppose that y <^ N^^'^. Further 
assume that q is /c-smooth (we will be interested in the case q = W) and satisfies 
q <^ N°^^\ Then the coefficients of this quadratic polynomial in x have the following 
properties. 72 is fc-smooth, and 72 < g^A^^^"^)/^ 71 < g^A^(2-i)/2^ ^ gOjY(2-o)/2_ 

The proposition below is tailored to address polynomials with these specific proper- 
ties. 

Proposition 15.2. Let < 5 < 1/2 and suppose 5^* <tit N for all t E N. Let k 

be a positive integer and suppose that the polynomial sequence : Z — M, g{n) = 
Y2j=o(3jn'^ hO'S the property that for every k-smooth integer q, q ^ N°^^\ for every 
choice of ki, . . . , kd with < kj ^ 5^'^'^ ^ for j = 1, . . . ,d, and for every sufficiently 
large N, we have 

sup ykjPj\\6^'''{N/qy ^ 1 . 

Then, if P{n) = XliLo^*^* ^'^ integer- coefficient polynomial of degree d' whose leading 
coefficient is a fc-smooth integer satisfying 7^/ < N°^^\ while all other coefficients satisfy 
the inequality 7^ ^ A^^'^ -^)/d ^^,^1'^ ^ hjq obtain a conclusion similar to the one in the 
previous proposition: 
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Then there is a k-smooth number q, q <^ N°^^\ such that each of the sequences {g o 
P(gn + r)(modZ))„g[(^y^^^-d/-)i/d'j forr G [q] is totally S^^'^'^-'^'^^^ -equidistributed, provided 
N is large enough. 

Remark. The unconventional form of the inapproximabihty conditions imposed on the 
Pi comes out of our choice of major and minor arcs; cf. Proposition 15.4 and the next 
section. 

Proof. Consider q := 7^^/, and let, as in the previous proof, i^ be the maximal index for 
which 

U^'kM\5''-''iN/qy-:^i 

for all kig ^ 6~'^'' Thus, for i > io there are ^ * such that 

\\q'KM'^6'''~\N/q)-' . (15.3) 

We wish to employ this information to proceed as in the previous proof, that is, we 
wish to assume for contradiction that all coefficients oi g o P are close to rationals. In 
particular this would apply to the {iQd')-i]i coefficient. Writing that coefficient as a 
linear combination of /3's we would like to deduce that then has to be close to a 
rational, which produces a contradiction. Unfortunately, the above information is not 
quite sufficient for our purposes yet: we require similar bounds on instead of 

on llg^K^/J^II. To work around this, we pass to higher powers of g, aiming to find 
a small t and an index it such that \W'^'''^ki^(3i^\\ ^ S^"^ (A^/g*^*)"**, but for i > it 
\\q^^ke(3e\\ ^ {N/q^)~^. The gap between g**^*' and g*^ (for £ > it) introduced by 

the extra factor d will be sufficient to analyse g o P on subprogressions modulo g*'^/'^ . 
Returning to the proof, note that (15.3) implies 

for i > io and for all positive integers t. By assumption on the rationality properties of 
the j = 1, . . . ,d, there is an index ii, which by the previous observation necessarily 
satisfies ii ^ io, such that 

\\q'''hM\6''-'\N/q'r:^l 

for all ki^ ^ 

Proceeding like this, we obtain a decreasing sequence io ^ H ^ ^2 ^ ■ ■ ■ of positive 
integers such that for every j the following two families of inequalities hold: 

||g(^'+^)^^A:,/,J»r2"'^(iV/g(^-+i))-^- 

for all ki. ^ ^ , and for every i with d^ i > ij there is ne ^ ^^^'^ ' such that 

\\q^'+'^'Ke/3e\\ <^6-^"'\N/q(^+''>r' . 

By positivity of the indices ij, there is t = Od^d'i^) such that it-i = it = it' for all 
t < t' ^ tdd'. Setting r = tdd', we therefore have 

||g(i+^)^*A;,,AJ|52^-"(iV/gi+^)^* » 1 (15.4) 

for all fcjj ^ 6^'^'' '\ while we find for every £ > it a. positive integer ki ^ 5"^'' ' such 
that 

\\q''Ke/3e\\ <t: 6-^"\N/q')~' . (15.5) 
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Now recall that d! = deg(P) and consider the sequence g{P{q^/'^' n + '"))„g[(7v/g^+i)i/d'] 
for an arbitrary r G [q^^'^']. 
Defining coefficients (Xj by 

dd' d 

a^n' = g{P{q^''^'n + r)) = J] Pj{P{q^/^'n + r)y , 
1=0 i=o 

we have 

with integer coefficients C^. We need a bound on and proceed to show that = 
0(A^^~*'g**). Expanding out products yields 



d' 



rV 



oi,...j,)G[dr {uu...,ueK ^ ^ ^ ^ 

Consider any term involving (g'^^'^'n)*''^' = g^'n***^'. 

If ji + ■ ■ ■ + je = ui + ■ ■ ■ + ue = ltd! , then the coefficient of q'^'^^ri'^'^ is . . . 7^-^ ^ 
A^^'**g*'. If ji + ■ ■ ■ + j£ > ui + ■ ■ ■ + = then the coefficient of [q^nY^'^ is bounded 
by Orf,rf'(7j, ...7j,r"') = Od,d'(^^"*'"^^^'^'^A^'^') = 0{N^-^'), since r < g^/'^' < iV°(i). 
Thus in total, Ct = 0{N^-''q''). 

We return to analysing the rational approximations of the individual terms of (15.6). 
Notice that t ^ ti for all £ G [d]. Thus (15.5) guarantees for i > it the existence of 
i^e ^ such that 

< r2'''Ar-^g"'Q < 6-^'''N-''{q^+'y' (15.7) 

holds. 

We are finally in the position to show that there is A = Od,d'{^) such that 

g{P{q^l'^'n + r))„g[(^/g.+i)i/d'] 

is totally ^^/'^-equidistributed. More precisely, we show that there is A' = 0^,^/(1) such 
that for every ki^d' ^ 5-2'''' /A' 

holds true. From here the result follows from Proposition 14.2K a). Suppose for contra- 
diction that 

\\kud'a..d'\\ < {N/q^^'r^6-'''''-'''/^' 

for some K := Kd ■ ■ ■ K,it+i {oT K = 1 when the product is empty), 

then, since K£ ^ , 

\\nk,td'cr.,A « (W+i)--r2''''''-''VA'5-(2-H-i) _ 
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Considering the summands in (15.6), the bounds (15.7) imply 

Appeahng to the assumptions that both q and are bounded by N°^^\ the above is 
seen to equal 0{N~^'^°^^^) = o(l) since it ^ 1. Thus, provided N is large enough, no 
wrap-around issues appear when examining the circle norm ll/cit^'KCTi^d/ 1| and we find the 
following statement on rational approximation of Pi^ 

\\kitd'Kl3i,{-fd'q^y'\\ = Wkitd'f^PiAl^^^Y'W 

= Wki.d'HCTi^d' - ^/3£/tg"*C<;|| 

i>it 



^ \kitd'Kai,d' \ + ^ \(3iKq'^''Ci 



e>it 

^ ^-(2d-H^l)^-2'''(''-H)/A'p^-it(^^T+lyt _ 

Choosing A' = 22^'^^~^^\ this shows that there is k, namely k = ki^d'i^ci^\ bounded by 
5"^'* such that 

contradicting (15.4). □ 

Combining either of the previous two results with the quantitative Leibman theorem, 
the general case of the equidistribution theorem for subsequences follows. 

Proposition 15.3 (Equidistribution of polynomial subsequences). Let N,d,d' be pos- 
itive integers, and let Lq and 6 G (0, 1/2) be parameters, and suppose that for 
all t G N. Let g G poly(Z, G,) be a polynomial sequence of degree d and suppose that 
the finite orbit {g{n)r)n^[N] is totally 6-equidistributed in G/T. Let P : Z — t- Z 6e an 
integer- coefficient polynomial of degree d' whose coefficients are bounded by Lq. Then 
there is some A = Od,d'{^) such that whenever Lq < 6, then the polynomial subsequence 
{{g o -P)('^))„g[(7v/^^,)i/<i'] is totally 5^^'^'^-'^'^^^ -equidistributed on G/V . 

Proof. We first pass to the abelian setting: by Proposition |14.3[ there are constant 
A, A' = Om,d{l) such that every sequence {rj o (y'(n))„g[Ar] for a horizontal character rj 
of modulus at most is totally (J^/'^'-equidistributed. Applying Proposition 15.1 we 



deduce that for each such character rj the sequence (rj o g o -P(?^))„g[(7v/^^,)i/d'] is totally 
(5^/'^'*'<*' ™*-^^-equidistributed in M/Z. An application of the other direction of Proposition 



14.3 then allows us to return to G/T and deduce the stated equidistribution property 



of of {g o ^(^))n6[(^/^^,)i/d'] in G/T. □ 

Similarly, Proposition 15.2 results in an assertion for polynomial orbits on general 
nilsequences: 

Proposition 15.4. Let N,d,d',k be positive integers, and let 6 G (0, 1/2) be such that 
A^ for all t G N. Let g G poly(Z, G,) be a polynomial sequence of degree d and 
suppose that for every k-smooth number q, q <^ N°^^\ the sequence {g{qn)T)n<^[]\f/g-\ is 
totally S-equidistributed in G/T. 

Suppose further that P : Z ^ Z is an integer- coefficient polynomial of degree d' as in 



Proposition 15.2. That is, if P{n) = ^^=oT*^*' then 7^/ is a k-smooth with 7^/ < N''^'^\ 
while all other coefficients satisfy the inequality 7^ ^ ]\j'id-'-'>-)/^'^^,^/d-\ 
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Then there is a k-smooth number q, q <^ N°^^\ such that each of the sequences 
{g o P[qn + '")r)„g[(Ar/^^,gd')i/<i'] for r G [q] is totally 6^^'^'^''^'^^^ -equidistributed in G/T, 
provided N is large enough. 

Proof. Let rj : G/T — )■ M/Z be an arbitrary non-trivial horizontal character of modulus 
bounded by §^'^"^•'1^^^ and suppose that rjog has the polynomial representation y]og{n) = 
Yl'l=ol^3^^ M/Z. Let q, q ^ N°^^\ be /c-smooth and consider the sequence 

(r/ o g{qn)T)n^[N/q] ■ 



By the equidistribution assumption on the subsequences of g, by Proposition |14.3| and 
by Proposition 14.2K b), there is an integer B = 0^(1) such that for every choice of 
ki, . . . ,kd with < kj ^ ^ ior j = 1, . . . , d, and for every sufficiently large N, we 
have 



sup ||g^fc,/3j||52'"'/^(Ar/g)J' ^ 1 



Thus, with (5^/^ in place of S, the conditions of Proposition 15.2 are satisfied and hence 
there is g <^ such that for every r G [q] the sequence 

(riogo P{qn + r) (mod Z))„6[(7v/7,,g)i/^'] 

is totally ^^^'^'^-'^''-^''-equidistributed in M/Z, provided is large enough. An application 
of Proposition 14.3 to get back to G /V gives the result. □ 



16. The factorisation into minor and major arcs 

In view of the previous section, a 'minor arc sequences' g G poly(Z, G,) should satisfy 
the conditions of Proposition |15.4| in order to guarantee its applicability. That is, 
given A; G N, 5 = 5{N) G (0,1/2), and R <ti N"'^^\ the sequence g should have the 
property that for every fc-smooth number q ^ R the finite sequence (fi'(g^^)^)nG[iv/g] is 
5-equi distributed in G/T. 

In this section we will achieve a factorisation of an arbitrary polynomial sequence g 
into a product eg''j, where e is slowly varying ('smooth'), 7 is periodic with a fc-smooth 
common difference, and g' has the 'minor arc property' described above. We will ensure 
that g' satisfies a slightly stronger version of this: when we restrict g' to subprogressions 
on which 7 is constant and on which e is almost constant, then the restricted sequence 
still enjoys the 'minor arc property'. 

This factorisation will be obtained by iteration of the Green- Tao factorisation theorem 
[Tin Thm. 1.19] employing its dimension reduction as a guarantee for termination of the 
iteration. Before we state the factorisation theorem, we recall the notion of smoothness 
of sequences. 

Definition 16.1 ((M, A^)-smooth sequence, Def.1.18). Let G/T be a nilmanifold 
with Q-rational Mal'cev basis X and metric d = dx. Let {e{n))ni^z be a sequence in G, 
and let M,N ^ 1. Then e is said to be {M, N) -smooth if both d{e{n), idc) ^ M and 
d{e{n),€{n — 1)) ^ M/N are satisfied for all n G [A^]. 

In the later iteration of the Green- Tao factorisation theorem we will encounter a 
product of smooth sequences, which needs to be shown to be smooth itself. Notice 
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therefore that, when {e{n))n(zz is (M, A^)-smooth and when (e'(n))„,gz is {M,N/q)- 
smooth, then the triangle inequahty and right-invariance of the metric d yield 

d{e{qn + j)e'{n), idc) ^ d{e{qn + j), idc) + d{e\n), idc) ^ 2M 

for all n G [N/ q] . Employing also the approximate left-invariance of d (see [TUt Lemma 
A. 5]), we obtain 

d{e{qn + j)e'{n),e{q{n - 1) + j)e'{n - 1)) ^ 2qQ^^^^M/N . 

Thus, {e{qn + j)e'{n))nez is (2Q^(i)M, A^/g)-smooth. 

The tool to split into major and minor arcs is the following Green- Tao factorisation 
theorem. 

Theorem 16.2 (Green-Tao, Thm 1.19 [lO]). Let m,d ^ 0, and let Qo,N ^ 1 and 
A > be real numbers. Suppose that G/T is an m- dimensional nilmanifold together with 
a filtration G, of degree d. Suppose that X is an Q^-rational Mal'cev basis X adapted 
to G, and that g G poly(Z, G,). Then there is an integer Q with Qo ^ Q ^ Qq ''^'"''''^^\ 
a rational subgroup G' G, a Mal'cev basis X' for G' /V in which each element is 
an Q-rational combination of the elements of X , and a decomposition g = eg''y into 
polynomial sequences 7 G poly(Z, G,) with the following properties: 

(1) e : Z ^ G is {Q, N)-smooth; 

(2) g' : TL ^ G' takes values in G' , and the finite sequence {g' {n)V')n(z[N] is 1/Q^- 
equidistributed in G' /V , using the metric dx' on G' /T' ; 

(3) 'y : Tj ^ G is Q-rational, and (7(n)r)„gz is periodic with period at most Q. 

The proof of our modified factorisation theorem will proceed via an iterative applic- 
ation of the theorem stated above. Our next aim is to prove an auxiliary lemma which 
will guarantee that the iteration process stops after finitely many steps. The way this 
goal is attained is to ensure that every time we refine our splitting of [N] into subpro- 
gressions the polynomial sequence g we try to factorise fails to be totally equidistributed 
(with some parameter) on each of the new subprogressions. This way an application of 
the factorisation theorem on any new subprogression yields a lower dimensional rational 
subgroup. 

Lemma 16.3. Let G/T be an m- dimensional nilmanifold and let g G poly(G',,Z) be a 

polynomial sequence of degree d. Let 5 G (0, 1/2) be such that for all t G N. 

Further let a ^ 5^^ be an integer, and 6 G [a]. Suppose that {g{q{an + b))T)ne[N/q] fails 
to be 6 -equidistributed in G/T for some q ^ N°^^\ Then there is some B = Om,d{i) 
such that each of the sequences {g{n{aqY + r)r)„g[jv/g<*] for r G [(aq')°'] fails to be 6^- 
equidistributed in G/T. 



Proof. By Proposition 14.3 and Proposition 14.2K a), there is a non-trivial horizontal 



character r] of modulus bounded by such that the function h : Z M/Z defined 

by h{n) := r] o g{q{an + h)) satisfies ||/i||c°°[Af/aq] < Let rj o g{n) = Yl%oPj^^ 

and 7] o g[q[an + h)) = Yl'j=o ^i^^ polynomial representations in M/Z. Then 



sup ii^j 
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Since 



a,- = P,{aqy + (^\p,{aqy {bqf-^ , (16.1^ 



we find, using a downwards induction starting with j = d, that 

\\(3j{aqf\\ < r^'"-''WAr-^'(ag)'^ = o(l) . 

Indeed, for j = d the assertion is immediate. Suppose now it holds for j G {jo + 1, . . . ,d} 
for some jo ^ 1. We proceed to check the case where j = jo by analysing ( |16.1[ ) for 
j = jo, multiplied through hj t = [aqY~^°. Observe that for all positive integers t and 
for alH G {1, . . . , d} 

By the assumptions on 5, a and g, this bound is o(l) when t = {aqY~^° and i = jo- 
Similarly, we have by induction hypothesis for i G {jo + 1, ■ ■ ■ ,d} and all t 

\\(3e{aqyt\\ t\\(3e{aqY\\ <^tN~\aqY5-°''-^'^^^ , 

which certainly is o{N~^^^{aqY5~'~''^'"'^^'') if we set t = {qbY~^°. This allows to apply 



the triangle inequality to split up ||crjo('^?)'^~''°ll manner of (16.1) to deduce the 

assertion for jo- 

Next, pick r G [(aq')'^] and define 5"o, . . . , such that r] o g[[aqYn + r) = X]j=o ^j^"'; 
then 

Since jd ^ c? for all j G {1, . . . , c?}, we have for each of the summands 

By the assumptions on 5 and g, this bound equals o(l) and hence we can apply the 
triangle inequality to split up ||5"j||: 

d 

By Proposition 14.2K b) and Proposition 14.3, this implies the result. □ 



Now we finally turn to the modified factorisation theorem which gives the correct 
type of minor arcs. 

Theorem 16.4 (Modified factorisation theorem). Let m,d, N, A ^ 1 be integers, and let 
k,Qo,R ^ 1 be integer parameters. Suppose that G/T is an m- dimensional nilmanifold 
together with a filtration G, of degree d. Suppose that X is an Qo-rational Mal'cev 
basis X adapted to G, and that g G poly(Z, G,). Suppose further that Qq ^ log/c and 

k,R = 0(iV°(i)). Then there is an integer Q with Qo ^ Q <^ Qq^'"^''^^^\ and a partition 
of [N] into at most disjoint subprogressions P, each of length at least N/W^"^ 
and each of k-smooth common difference bounded by R^'"^ such that the restriction of 
{g{n))n^P to any of the progression P can be factorised as follows. 
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There is a rational subgroup G' ^ G, depending on P and a Mal'cev basis X' for G' /T' 
such that every element of X' is a Q-rational combination of elements from X (that is, 
each coefficient is rational of height bounded by Q). Suppose P = {n = r (modg)}, 
then we have a factorisation 

g(qn + r) = ep{n)g'p{n)-fp{n) , 

where ep,g'p,jp are polynomial sequences from poly(Z, G.) with the properties 

(1) Ep : Z ^ G is (Q, N/ q) -smooth; 

(2) g'p : Tj ^ G' takes values in G' and for each k-smooth number q ^ R the finite 
sequence {g'p{qn)T')n^N/{qq) is totally Q~^-equidistributed in G' /V ; 

(3) '^P : Tj ^ G is Q-rational and (7j(n)r)„g2 is periodic with a k-smooth period 
which is bounded by R'^'^Q. 

Proof. We may suppose that g does not satisfy (2), that is, there is some /c-smooth 
integer qi ^ R and 6i < ai ^ Qq such that {g{qi{ain + bi))T)n^N/qi fails to be Qq^- 
equidistributed. Writing zi := {aiqiY, Lemma 16.3 imphes that each of the sequences 



{g{zin + ri)T)n^M/z-i with ri G [zi\ fails to be Qq"^"^ -equidistributed for some A' = 
Om,d{^)- Now, we run through all ri G [zi] one after the other. 

Applying the factorisation theorem in its original form to any of these sequences 
yields some Qi <^ Qq^^'"^''^\ a proper Qi-rational subgroup Gi < G of dimension 
strictly smaller than m, and a factorisation 

g{zin + ri) = {n)g'^._^ (72)7^, (n) 
where the finite sequence {g'r^{n)Ti)n^N/zi is totally Qr'^-equidistributed in 

Gi/Fi := Gi/(rnGi) . 

If gl.^ is Q^'^-equidistributed on every subprogression {n = 62 (mod 0252)} of fc-smooth 
common difference 02^2, where 62 < ^2 < Qi and q2 < R, then we stop (and turn to the 
next choice of ri). Otherwise, invoking Lemma 16.3 again, there is a /c-smooth integer 
a2q2 as above such that with Z2 := (ci2'?2)'^ the finite sequence (5'ri,r2(^))n^Ar/(ziZ2) defined 
by 5'ri,r2(^) '■= g'r^{z2n + r2) is not Q^'^-equidistributed for any r2 G [2:2] • We proceed as 
before. 

This process yields a tree of operations which has height at most m = dimG, since 
each time the factorisation theorem is applied a new sequence g[._^ is found that 
takes values in some strictly lower dimensional submanifold Gj = Gj(ri, . . . , rj) of 
Gj_i(ri, . . . ,rj_i). Thus, we can apply the factorisation theorem at most m times in a 
row before the manifold involved has dimension 0. 

The tree we run through starts with g, which has Zi neighbours gr^, one for each 
ri G [2:1]. Each gr^ has Z2 = Z2{ri,r2) neighbours gri,r2y one for each r2 G [Z2], etc.. 

As a result, we obtain a decomposition of the range [N] into at most i^^rfm g^i^p^-g- 
gressions of the form 

P = {zi{z2{z3{. . . {ztiTL + rt) . . . ) + rs) + r2) + ri : m < N/{ziZ2 . . . Zt)} 
= {ziZ2 . . . Ztm + r : m ^ N/ {ziZ2 ■ ■ ■ Zt)} , 

for some r, and where each Zi depends on ri,...,rj_i. The common difference of 
such a progression P is fc-smooth and bounded by R'^'^"^. Thus, P has length at least 
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jsi^ jl'^dm _ o„(i)^ rjj^g iteration process furthermore yields a factorisation of gri,...,rt^ 
which is the restriction of g to P: 

fi'ri,...,n("i) = giziZ2 ...ztm + r)= er^^...,rtim')9tim)lr^,...,rtim) , 

where 

£ri,...,rt("^) = er^{z2 . . . ZtTTl + ■ ■ ■ £ri,..,rt_i (^^t^ + rt)£ru-,rti^) 

for certain integers f2, rs, . . . , ft, and 

7ri,...,n (m) = 7ru-,rt ("i)7ri,...,rt_i {Ztin + ft) ■ ■ ■ 7ri {Z2 ■ ■ ■ ZtlTl + fg) . 

In view of the remarks following the definition of smoothness of sequences, the factor 
(Q^^"*""^'\iV/(^i...Zt))-smooth sequence. Further, the periodic se- 
quences 7ri,...,ri("^) are easily seen to have a Qq^''* ™ '■^''-smooth, i.e. fc-smooth, period. □ 

17. Reduction to the case of minor arc nilsequences 



With the help of the modified factorisation theorem. Theorem 16.4 , we will show that 
the general non-correlation estimate follows from the special case of non-correlation with 
'minor arc nilsequences' that enjoy property (ii) of the modified factorisation theorem. 

The general case is the following proposition. 

Proposition 17.1. Let G/T be a nilmanifold of dimension m ^ 1, let G, be a filtration 
of G of degree d ^ 1, and let g G poly{Z, G,) be a polynomial sequence. Suppose 
that G/T has a Q-rational MaVcev basis X for some Q ^ 2, defining a metric dx 
on G/V . Suppose that F : G/V — )■ [—1, 1] is a Lipschitz function. Then we have for 
Mo = log log log AT and N' = [N/W\ 

|E„e[^,](r},^(n) - l)F(^(n))r| gO-''->^(i)(l + ||F||)Mo-^ 

for any A > and N ^ 2. 

Similarly as in §2 of [9], we will deduce this result from the following special case 
involving only 'minor arc nilsequences'. 

Proposition 17.2 (Non-correlation, equidistributed case). Let N > be a large integer 
and let 6, k and R be parameters such that 6 G (0, 1/2), 5^* N' for alltEN,R'^ 
N"^^^ and k = w{N). Suppose that {G/T,dx) is an m- dimensional nilmanifold with 
some filtration G, of degree d and suppose that g G poly(Z, G,). Suppose further that 
for every k-smooth number q ^ R the finite sequence {g{qn)V)ne[N' /q] is 5 -equidistributed 
in G/V. Then for every Lipschitz function F : G/T — )■ M satisfying J^^^ F = and for 

every k-smooth number q ^ A^°(^) and every r E [q], we have 

\Ene[N']ir'j^f,{qn + r) - l)Figin)T)\ < 5'\\F\\ 
for some c such that c^^ = 0^,^(1)- 

Proof of Proposition \17.i assuming Proposition 17.^ Observe that A^' = N^~°^^\ We 
may assume that Q ^ Mq, thus Q ^ Mq = log w{N). The modified factorisation 
theorem can now be applied to the sequence {g{n)T)ns^N with the following parameters: 
k = w{N), Qo = log w{N), R = A^°(^). This yields a partition of [A^'] into at most R^""^ 
progressions of w(A^)-smooth common differences. By the triangle inequality, it suffices 
to show that 
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for every progression P in the partition. 

For each of these progressions P =: {qpfi + rp}, the modified factorisation theorem 
provides us with a factorisation of the restriction of g to P: 

g{qpn + rp) =: gp{n) = ep{n)g'p{n)'yp{n) , 



where ep,g'p,'~fp satisfy {{), {ii) and [iii) from Theorem 16.4 Proceeding as in Pl §2] 
(see loc. cit. for full details), we spht each P into subprogressions P = PiU ■■■ U Pt in 
such a way that 

• '~fp{n) is constant on each progression, say 'Jp{n) = for n G Pj, and 

• ep{n) is almost constant: to be precise, the Pj are such that \n — n'\ ^ N'/{qQ^) 
for some B = 0(1) and all n,n' G Pj which implies d{ep{n),ep{n')) ^ Q~^^^ 
by smoothness of £:p. 

From each Pj, we choose a fixed element, say rij. Then the Lipschitz property of F, 
right-invariance of the metric, and smoothness of ep imply that for every n G Pj 

\F{epin)g'pin)^{n)T) - F{ep{nj)g'pin)^,T)\ ^ Q-^/' , 

provided B was chosen large enough. Hence it suffices to show that 

|E„eP,(r;,/3H - l)F{epinj)j,{^-'g'pin)jj)T)\ «^,,,,,^ qO^.,..Ai)(^i + ||F||)Mo-^ . 



The aim is now to apply Proposition 17.2 to j : Z — t- 'J, ^G'jj =: H 



9jin) ■■= Ij g'piqp.n + rp^)-fj . 

Property {ii) of the modified factorisation theorem was set up in such a way to en- 
sure that gj still enjoys the 'minor arc property' (on Hj/{r fl Hj) rather than G/T of 
course). Note that the Lipschitz constant of Fj : Hj/(T H Hj) — )■ C, Fj{x(T H Hj)) : = 
F{ep{nj)'yjT) is bounded by M||F|| by [IHl Lemma A. 16]. Si nce P, has a w(A^)-smooth 



common difference and length at least Proposition 7.4 implies that (r/^/j — 1) 

does not correlate with any function n clp.{n), where c is a constant. Hence we can 
subtract off the mean value of Fj and reduce to the assumption f^j.^j^, Fj = 0. 

All remaining technical details work exactly as in [9, §2 and App.B], so we have 
chosen, given their technical complexity, to omit them here. □ 

18. Completion of the non-correlation estimate 



We complete the proof of Proposition |17.2| and therefore the analysis of correlation 
of r'jr 1^ with nilsequences. Recall the conditions of Proposition 17.2 In particular, we 



are given a polynomial sequence (5'(n)r)„g[Ar/] such that for every w(A^)-smooth number 
q ^ R the finite sequence {g{qn)r)n£[N'/q] is (5-equidistributed in G/T. The parameter 



6 satisfies the condition A^', which will allow us later to apply Proposition 15.4 

We are required to show that for every Lipschitz function F : G/P — ?■ M satisfying 
Jg^P F = 0, for every w(A^)-smooth number q ^ A^"'^^-', and for every r G [q], we have 

KelN'M{r'f,p{qn + r) - l)F{g{n)r)\ « ^Vo^.^d) ||^|| _ 

By 5-equidistribution of ((7(n)r)„g[jv] and since / F = 0, it suffices to show that 

\Ene[M'/,]r'f,p{qn + r)F{g{n)T)\<^6'/''-"'^'^\\F\\ . 
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Figure 1. Schematic of how the summation is spht: we sum along hori- 
zontal lines {y fixed), along vertical lines {x fixed), and another time over 
the segments of the horizontal lines that are contained in the 'box', that 
is, over the double-counted segments. 



We may suppose that / = (a, 6, c) has reduced form, that is |6| ^ a ^ c. Writing 
X{f,N) := {{x,y) : f{x,y) ^ A^}, our aim is to decompose the binary sequence 

{giiax"^ + bxy + cy'^)T)}(^^y)^x(f,N) 

into a sum of polynomial subsequences (5''(P(n)r))„^(^/)i/dog(p) of some equidistributed 
sequence {g'{n)T)n<^N'- In order to do so, let (xo,yo) ^ be the point on the ellipse 
f{x,y) = ax"^ + hxy + cy'^ = N that satisfies xq = ~ A^^^^. Since / has reduced 
form, both axg ^ and cy^ ^ hold. With respect to {xo,yo), the summation over 
{x,y) e X{f,N) now splits into three parts (cf. Figure [T]) such that on each part one 
of the variables x and y may be fixed, while the free variable will range over an interval 
of length at least Xq ~ A^^/^. This decomposition yields 



27r 



r'fA^n + r)F{g{n)r] 

n^{N'-r)/q 



+ 



W J 

w 
w I 



-1 



f{x,y)=Wr+l3 (modWg) 

y^yo x:f{'x,y)^N 



'^f(x,y)=Wr+l3 (modWq) 

y:f{x,y)^N 



E El 

y^yo x^xo 



f{x,y)=Wr+l3 (raodWq) 



Fig 



f{x,y) - (3 -Wr 
Wq 

f{x,y) - f3 -Wr 



r 



Wq 

f{x,y) - ^ -Wr 
Wq 



:i8.i) 



To remove the congruence condition f{x, y) = Wr + fi (mod Wq) in this explicit form, 
we consider the set S{qW^Wr + (5) of all solutions {x',y') G [qW]^ to the congruence 



f{x',y') = Wr + (5 {uiodqW). By Corollary 6.4 the density of these solutions for a 



w(A^)-smooth integer q, r G [q] and j3 E A satisfies 



qW 



W 
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To simplify the notation, define (3' := Wr + /3 and q' := Wq. Considering any of the 
three parts of our summation above, we may continue this as follows 



w 

qW E(a,/y)g5(g/_^/) 



X] 5Z ^f{x,y)=l3' {modq')F[g 
y^yo x:f{x,y)^N 



fix,y)-/3' 



r 



y.q'y+y'^yo 



x:f{q'x+x',q'y+y') 



f{q'x + x',q'y + y')-P' 
q' 



r 



Observe that 



f{Wqx + x', Wqy + y') - Wr - (5 



Wqax^ + h'x + c 



for some 6', c' dependi ng on y', x', 6, c, g and W ^ is a polynomial that satisfies the con- 
ditions of Proposition 



15.4 



Thus, setting P{x) := Wqax"^ + h'x + c', we are considering 
the polynomial subsequence {g o P(n)r)„^(^//gg/)i/2 of {g{n)V)n^]^i /q. By Proposition 
there is for each P a w(A^)-smooth integer q <^ A^°(^) such that for every f G [q\ 



15.4 



the sequence 

g{P{qx + ^))2:^Ari/2/(|ygg) 

is totally 5^/'^'**^^^-equidistributed. Splitting the summation into subprogressions modulo 
g, we have via the triangle inequality 



qW ¥.(^^>^y,)(,s{q>,i3') 

y-q'y+y'^yo 



J2 F{goP{x)T) 



x:f{q'x+x',q'y+y') 



y:q'y+y%yo f 



E 



F(5f o P(gx + r)r) 



a::/(q'(qx+f)+a;',(j'j;+j;') 



j\ri/2 



y.q'y+y'^yo 



Wqq 



< <5Vo.(i) 



— I 



I Lip 



As these arguments also apply to the two remaining parts of the sum (18.1) this com- 
pletes the proof of Proposition 17.2 and also the proof of the main theorem. 
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